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RECOMBINATION OF POLYNUCLEOTIDE SEQUENCES 
USING RANDOM OR DEFINED PRIMERS 

The U.S. Government has certain rights in this invention pursuant to 
Grant No. DE-FG02-93-CH10578 awarded by the Department of Energy and 
Grant No. N000 14-96- 1-0340 awarded by the Office of Naval Research. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to in vitro methods for 
mutagenesis . and ' recombination "of polynucleotide sequences. More 
particularly, the present invention involves a simple and efficient method for in 
uiiro mutagenesis and recombination of polynucleotide sequences based on 
polymerase -catalyzed extension of primer oligonucleotides, followed by gene 
assembly and optional gene- amplification. 

2. Description of Related Art 

The publications and other reference materials referred to herein to 
describe the background of the invention and to provide additional detail 
regarding its Dracticc are hereby incorporated by reference,. For convenience, 
the reference materials are numerically referenced and grouped in the 
appended bibliography. 

Proteins are engineered with the goa> of improving their performance 
for practical- applications. ... Desirable, properties depend on the application of ^ 
interest and may include tighter binding to a receptor, high catalytic activity, 
high stability, the ability to accept a wider (or narrower) range of substrates, or 
the ability to function in nonr.aturai environments such as organic solvents. A 
variety of approaches, including 'rational'- design and random mutagenesis 
methods, have been successfully used "to opumizc protein functions (1). The 
choice of approach for a given optimization problem will depend upon the 
degree of understanding of the relationships between sequence, structure and 
function. The rational redesign c: an er.tryrne catalytic site, for example, often 
requires extensive knowledge of the'entryrne structure, the structures of its 
complexes with various ligands and analogs of reaction intermediates and 
details of the catalytic mechanism. Such information is available only for a 
very few welt- studied systems; little is known about the vast majority ' of 
potentially interesting enzymes. Identifying the amino acids responsible for 
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existing protein functions and those which might give rise to new functions 
remains an often -overwhelming challenge. This, together with the growing 
appreciation that many protein functions are not confined to a small number 
of amino acids, but are affected by residues far from active sites, has 
prompted a. growing number of groups to turn to random mutagenesis, or 
'directed' evolution, to engineer novel proteins (1). 

Various optimization procedures such as genetic algorithms (2,3) and 
*. evolutionary, strategies (4,5) have been inspired by natural evolution. These 
procedures employ mutation, which makes small random changes in 
< members of the population, as well as crossover, which combines properties of 
different individuals, to achieve a specific optimization goal. There also exist- 
strong interplays between mutauon and' crossover, as shown by computer 
simulations of different optimization problems (6-9). Developing efficient and 
practical experimental techniques to mimic these key processes is a scientific 
challenge. The application of such techniques should allow one, for example, 
to explore and optimize the functions of biological molecules such as proteins 
and nucleic acids, in vivo or even completely free from the constraints of a 
living system (10,1 1). 

Directed evolution, inspired by natural evolution, involves the 
generation and selection or screening .of a pool of mutated molecules which 
has sufficient diversity for a' molecule encoding a protein with altered or 
enhanced -function- to be present therein, it generally begins with creation .of a' 
library *of mutated genes. Gene products which show improvement with, 
respect to . the desired property or set of properties arc identified by selection 
or screening. Thc.gcne(s) encoding those products can be subjected to further 
cycles of the process in order to. accumulate beneficial mutations. This 
evolution can involve few or .many generations, depending on how far one 
wishes to -progress and the effects of mutations typically observed in each 
generation. Such approaches have been used to create novel functional 
' nucleic acids (12), peptides and other small molecules (12), antibodies (12), as 
well as . enzymes and other proteins (13,14,16). Directed evolution requires 
little specific knowledge about the product itself, only a means to evaluate the ■ 
function to be optimized. These procedures arc even fairly tolerant to 
'inaccuracies and noise in the function evaluation (15). 

The diversity of genes for directed evolution can be created by 
introducing new point mutations using a variety of methods, including 
mutagenic PCR (15) or combinatorial cassette mutagenesis (16). The ability to 
recombine genes, however, can add an important dimension to the 
evolutionary process, as evidenced by its key role in natural evolution. 
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Homologous recombination is an important natural process in which 
organisms exchange genetic information between related genes, increasing the 
accessible genetic diversity within a species. While introducing potentially 
powerful adaptive and diversification competencies into their hosts, such 
pathways also operate at very low efficiencies, often eliciting insignificant 
changes in pathway structure or function, even after tens of generations. 
Thus, while such mechanisms prove beneficial to host organisms/species over 
geological time spans, in vivo recombination methods represent cumbersome, 
if not unusable, combinatorial processes for tailoring the performance of 
enzymes or other proteins not strongly linked to the organism's intermediary 
metabolism and survival. 

Several groups have recognized the utility of gene recombination in 
directed evolution. Methods for in vivo recombination of genes are disclosed, 
for example, in published PCT application V/O 97/07205 and US Pat. No. 
5,093,257. As discussed above, these in vivo methods are cumbersome and 
poorly optimized for rapid evolution of function. Stemmer has' disclosed a 
method for in vitro recombination of related DNA sequences in which the 
parental sequences are cut into fragments, generally using an enzyme such as 
DNasc I, and are reassembled (17,18,19). The non-random DNA 
fragmentation associated with DNase 1 and other endonuclcases, however, 
introduces bias into the recombination and limits the recombination diversity. 

Furthermore this method is limited to recombination of double-stranded 
polynucleotides and cannot be used on single-stranded templates. Further, 
this method docs no: work well with certain combinations of genes and 
primers. It is not efficient for recombination of short sequences (less than 200 
nucleotides (nts)) t for example. Finally," it is quite laborious, requiring several 
steps. Alternative, convenient methods- for creating 'novel genes by point 
.mutagenesis 'and recombination in vitro are needed. 

SUMMARY OF THE INVENTION 
The present invention provides a new and significantly improved 
approach to creating novel polynucleotide sequences by point mutation and 
recombination in vitro of a set of parental sequences (the templates). The 
* novel polynucleotide sequences can be useful in themselves (for example, for 
DNA-based computing), or they can be expressed in recombinant organisms 
for directed evolution of the gene products. One embodiment of the invention 
involves priming the template gene(s) with random-sequence oligonucleotides 
to generate a pool of short DNA fragments. Under appropriate reaction 
conditions, these short DNA fragments can prime one another based on 
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complementarity and thus can be reassembled to form full-length genes by 

repeated thermocycling in the presence of thermostable DNA polymerase. 

These -reassembled genes, which contain point mutations as well as novel 

combinations of sequences from different parental genes, can be further 
..-amplified by conventional PCR and cloned into a proper vector for expression 

of the encoded proteins. Screenihg.'or selection of the gene products leads to 
. new variants with improved or even novel .functions.. These variants can be - 

used as they are, or they can- serve as new starting points for further cycles of 
. . mutagenesis and recombination. • 

A second embodiment of the invention involves" priming the template 
: gene(s) with a set of . primer oligonucleotides of defined sequence' or defined- 

sequence exhibiting limited . randomness to generate a pool of short DNA 
• fragments, which are then, reassembled as described above into full length 

genes.- . " ( . 

. A third embodiment of the invention involves a novel process we term 
the 'staggered extension* process, or StEP.. Instead of reassembling the pool of 
fragments created by the extended primers, full-length genes are assembled 
directly' in the presence of 'the tcmplate(s). The StEP consists of repeated 
cycles of denaturation followed by extremely abbreviated annealing/extension 
steps. In each cycle the extended fragments can anneal to different templates ■ 
*' based on complementarity and extend a little further to create "recombinant 
cassettes." ' Due to .this, template switching, most of the polynucleotides 
contain -sequences from different parental genes (i.e. are novel recombinants). 
This process is repeated until full-length genes .form.. It can be followed by an' 
. optional gene amplificauon-step. . 

The different embodiments of 'the- invention provide features and ■ 
advantages, for different applications. In the most preferred embodiment, one 
or more, defined primers or defined primers exhibiting' limited randomness 
which, correspond to .or ...flank the .5' and 3* T ends of • the template 
■polynucleotides are used with StEP to generate .gene fragments which grow 
into the. novel full-length sequences. This simple .method requires no 
knowledge of the template sequencers). ' . 

In another preferred embodiment, multiple defined primers or defined 
primers exhibiting limited randomness are used to generate short gene 
fragments which are reassembled into full-length genes. Using multiple 
defined primers allows the user to bias in ii:ro recombination frequency. If 
seauence information is available, primers can be designed to generate 
• overlapping recombination cassettes which increase the frequency of 
recombination at particular locations. Among other features, this method 
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introduces the flexibility to take advantage of available structural and 
functional information as well as information accumulated through previous 
generations of mutagenesis and selection (or screening). 

In addition to recombination, the different embodiments of the primer- 
based recombination process will generate point mutations. It is desirable to 
know and be able to control this point mutation rate, which can be done by 
manipulating the conditions of DNA synthesis and gene reassembly. Using 
the defined-primer approach, specific point mutations can also be directed to 
specific positions in the sequence through the use of mutagenic primers. 

The various primer-based recombination methods in accordance with 
this invention have been shown to enhance the activity of Actinoplanes 
utahensis ECB deacylase over a broad range of pH values and in the presence 
of organic solvent and to improve the thermostability of Bacillus subtilis 
subtilisin E. DNA sequencing confirms the role of point mutation and 
recombination in the generation of novel sequences. These protocols have 
been found to be both simple and reliable. . 

The • above discussed and many other features and attendant 
advantages 1 will . become better understood by reference to the following, 
detailed description when taken in conjunction with the accompanying 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
PEG. 1 deoicts .recombination in accordance with the present invention 
using random-sequence primers and gene reassembly. The steps shown arc: 
a) Synthesis of single-stranded DNA fragments, using mesop'nilic or 

thermophilic- -polymerase- wish- random-sequence -oligonucleotides, .as-primcrs 

[primers not shown); b) Removal of templates; c) Reassembly with 
thermophilic DNA polymerase; d) Amplification -with thermostable 
polymerase(s); e) Cloning, and Screening (optional); and f) Repeat the process 
with selected gcne(s)^optionai). " • ; * 

FIG. 2 depicts recombination in accordance with the present invention 
using defined primers. The method is illustrated' for the recombination of two 
genes, where x = mutation.. The steps diagrammed arc:- a) The genes are. 
primed with defined primers in PCR reactions that can be done separately (2 
primers per reaction) or combined (multiple primers per reaction); c) Initial 
products are formed until defined primers are exhausted. Template is 
removed (optional) ; d) initial fragments prime and extend themselves in 
further cycles of PCR with no addition of external primers. Assembly 
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continues until full-length genes are formed; e) Ipjmpnall Full-length genes are 
amplified in a PCR reaction with external primers; f) (optional) Repeat the 
process with selected gene(s). 

FIG. 3 depicts recombination in accordance with the present invention 
using two defined flanking primers and StEP. Only one primer and two single 
strands from two templates are shown here to illustrate the recombination 
. process. The oudined steps are: a) After denaturation, template genes are 
primed with one defined primer; b) Short fragments are produced by primer 
extension for a short time; c) In the next cycle of StEP, fragments are 
randomly primed to the templates and extended further; d) Denaturation ar.d 
annealing/ extension is repeated until full-length genes are made (visible on an 
agarose gel); e) Full-length genes are purified, or amplified in a PCR reaction 
with external primers (optional); 0 (optional) Repeat the process with selected 
gene(s). 

FIG 4 is a diagrammatic representation of the results of the 
recombination of two genes using two flanking, primers and staggered 
extension in accordance with the present invention/ DNA sequences of Ave 

rn-i-v are indicated, where x is a mutauon 
genes chosen from tnc rccornbir.c- Krj.a-jr a^einaic 

• >u 'ar.d the triangle represents a new point 

sent in the parental gen^s, a..- ^ o 



pre 
mutation 



-PIG. 5 is a diagrr.mrr.auc representation of the sequences of the P N3 

™ A.^^a Sample 3. Template genes 2-13 and 5-B12 were 
■esterase genes des^..^^ .... ......^ 

. • , r - n - a~>'„„~ rHm~r approach. "The positions- of the primers . 

: recbmbined using Lne. p.-*.-- 

■ a' n~ 0 -s and the oosidons %vhcrc the parental sequences differ 

are indicated b> a.*o..o, — v 

from one another are indicated by xs. New point mutations are indicated by 
triangles Mutations identified in these recombined genes are listed (only 
positions which differ in the parental sequences are listed). Both 6E6 and 
6H 1 arc recombination products of the template genes. 

FIG. 6 shows the positions and sequences of the four defined internal 

.„.„ r . rr ._' 3 :-.,.- rcnes from template genes Rl and R2 
• primers used to generate rcco...o b- 

by interspersed primer-based recombination. Primer P50F contains a 
nutation (A->T at base position 59S) which simultaneously eliminates a 
Hindlll restriction sice ar.d adds a new unique Nhel site. Gene R2 also 
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contains a mutation A->G at the same base position, which eliminates the 
Hindlll site. 

FIG. 7 is an electrophoresis gel which shows the results of the 
restriction-digestion analysis of plasmids from the 40 clones. 

FIG. 8 shows the results of sequencing ten genes from the defined 
primer-based recombination library. Lines represent 986-bp of subtilisin E 
gene including 45 nt of its prosequence, the entire mature sequence and 113 
nt after the stop codon. Crosses indicate positions of mutations from parent 
gene Ri and R2, while triangles indicate .positions of new point mutauons 
introduced during the recombination procedure. Circles represent the 
mutation introduced by the mutagenic primer P50F. 

FIG. 9 depicts the results of applying the random-sequence primer 
recombination method to the gene for A ctino planes utahensis ECB deacylase. 
(a) The 2.4 kb ECB deacylase gene was purified from an agarose gel. (b) The 
size of the random priming products ranged from 100 to 500 bases, (c) 
Fragments shorter than 300 bases v.' ere isolated, (d) The purified fragments 
were used to reassemble the full-length gene with a smear background, (e) A 
. single PCR product of the same sLzc as the EC3 deacylase gene was obtained 
after conventional PCR with the two primers located at the start and stop 
regions of this gene. (0 After digestion with Xho I and Psh AI, the PCR product 
was cloned into a modified p:J702 vector to form a mutant library. (g) 
Introducing this library into Slrepiomyces iiuidans TK23 resulted in 
approximately 71% clones producing the active .ECB deacylase. . . . . 

FIG. 10 shows the specific activity of the wild -type ECB deacylase and 
mutant M16 obtained in accordance with the present invention. 

FIG. 1 1 shows pH profiles of activity of the wild -type ECB deacylase 
and mutant M16 obtained in accordance with the. present invention. 

FIG. 12 shows the DNA sequence analysis of 10 clones randomly chosen 
from the library/ Kienow. Lines represent 985-bp of subtilisin E gene including 
45 nt of its prosequence, the entire mature sequence and 1 13 nt after the stop 
codon. Crosses indicate positions of mutations from Rl and R2, while 



WO 98/42832 



PCT/US98/059S6 



• - - f 

triangles indicate positions of new point mutations introduced during the 

random- priming recombination process. 

FIG. 13 Thermostability index profiles of the screened clones from the 
five libraries produced using different polymerases: a) library/ Kle now, b) 
Library/T4, c) library/ Sequ en ase, d) Jibrary/Stoffel and e) iibrary/Pfu. 

Normalized residual activity" (Ar/Ai) after incubation at 65°C was used as an 
index of the . enzyme thermostability. Data were sorted and plotted in 
descending order. • ~ - . , 

DETAILED DESCRIPTION OF THE INVENTION 
In one preferred embodiment of the present invention, a. set of primers 

with all possible nucleotide sequence combinations (dp(N)L where L - .primer 
length) is used for , the primer-based recombination. It has been known for 
years that oiigodeoxynucleoUdcs of different lengths can serve as primers for 
initiation of DNA synthesis on single-stranded templates by the Kienow 
fragment of E.coli polymerase I (21). Although they are smaller than the size 
of a normal' PCR primer (i.e.. less thai: 13 bases), oligomers as short as 
hexanucleotid.es can' adequately prime the reaction and axe frequently used in 
labeling- reactions (22). The use of random primers to create a pool of gene 
fragments followed' by gene reassembly in accordance' with the invention is 
shown in" FIG. 1. The steps include generator, of diverse "breeding blocks" 
from lhc single- stranded polvnucieotidc templates through random priming, 
reassembly of the full-length DNA from the generated short, nascent DNA 
fragments by uSerrno eye ling in the presence of DNA polymerase, and 
nucleotides, and amplification' of the desired genes from the reassembled 
products by conventional PCR for further cloning ' and screening. This 
procedure' introduces, new mutations mainly at the priming step ' but also 
during other steps. These new mutations and the mutations already present 
in the template sequences are reccmbincd curing reassembly to create 'a 
library of novel DNA sequences. The process can be repeated on the selected 
sequences, if desired. 

To, carry out the random priming procedure, the template(s)* can be 
single- or denatured double-stranded polynucleonde(s) in linear or closed 
circular form. The templates can be mixed in equimolar amounts, or In 
amounts weighted, for example, by their functional attributes. Since, at least 
in some cases, the template ger.es are cloned ;n vectors into which no 
additional mutations should be Introduced, they are usually first cleaved with 
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restriction endonuclease(s) and purified from the vectors. The resulting linear 
.DNA molecules are denatured by boiling, annealed- to random-sequence,, 
oligodeoxynucleotides and incubated with DNA polymerase in the presence of 
an appropriate amount of dNTPs. Hexanucleotide primers are preferred, 
although longer random primers (up to 24 bases) may also be used, depending 
on the DNA ' polymerase and conditioning used during random priming 
synthesis. - Thus the oligonucleotides prime the DNA; of interest at various 
positions along the entire target region and are extended to generate short 
DNA fragments complementary to each strand of the template DNA. Due to 
events such as base mis-incorporations and mispriming, these short DNA 
fragments also contain point mutations. Under routinely established reaction, 
conditions, the short DNA . fragments can prime one another based on 
homology and be reassembled into . full-length genes by repeated 
thermocycling in the presence of thermostable DNA polymerase. The resulting 
full-length genes will have'diverse sequences/most of which, however, still' 
resemble that of the original template DNA. These, sequences can be further 
amplified by a conventional ?CR and cloned into a vector for expression. 
"Screening or"selectipn of the expressed mutants should lead to variants with 
improved or even new specific functions/ These variants can.be immediately 
used as partial solutions to a "practical problem, or they can serve as new 
starting points for further cycles'.of directed evolution.^ 

Compared to other techniques used for protein optimization, such as 
combinatorial cassette and oiigonucleotidc-dirccted -mutagenesis (24,25,26), 
error-prone ?CR (27, 23], or DNA shuffling (1.7,18,19), some of the advantages 
of -the -random-primer„base.c\.proccdure^ for_ in J vitro ..protein ^evolution are_ 

summarized as follows: , 

1. The template's) used for random- priming svmthesls may' be either 
single- or double-stranded polynucleotides. In contrast, error-prone PCR and 
the DNA shuffling method for recombination (17,18,19). necessarily . employ 
only double-stranded polynucleotides. .-Using the technique described here, 
mutations and/or crossovers car.- be introduced at the DNA level by using 
different DNA-dependent DNA polymerases, or even directly from mRNA by 
using different RNA-ccpendent DNA polymerases. Recombination can be 
performed using single -stranced DNA templates. 

2. In contrast to the DNA shuffling procedure, which requires 
fragmentation of .the double-stranded DNA template (generally done with 
DNAse I) to generate random fragments, the technique described here employs 
random priming * synthesis to obtain DNA fragments of controllable size as 
"breeding blocks" for further reassembly (FIG. 1). One immediate advantage is 
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that two sources of nuclease activity (DNase I and 5'-3' exonuclease) are 
eliminated, and this allows easier control over the size of the final reassembly 
and amplification gene fragments. 

3. Since the random primers are a population of synthetic oligo- 
nucleotides that contain ail four bases in every position, they are uniform in 
their length and lack . a sequence bias. The sequence heterogeneity allows 
them to 'form hybrids with the template DNA strands at many positions, so 
that every nucleotide of the template (except, perhaps, those at the extreme 5' 
terminus) should be copied at a similar frequency into products. In this way, 
both mutations and crossover may happen more randomly than, for example, 
with error-prone PCR or DNA shuffling, 

4.. The. random-primed DNA synthesis is based on the hybridization of a 
mixture of ' hexanucleotides to the DNA templates, and the complementary 
strands are- synthesized from the 3"-OH termini at the random hexanucleotide 
primer using polymerase and the four deoxynucleotide triphosphates. Thus 
the reaction is independent of the length of the DNA template. DNA fragments 
of 200 bases length can be primed equally well as linearized plasmid or X DNA 
(29). This is particularly useful for engineering peptides, for example. 
5. Since DNase' I is an endonuclease that hydrolyzes double-stranded 
DNA preferentially at sites adjacent to pyrimidinc nucleotides, its use in DNA 
shuffling may result in bias (particularly for genes with high G+C or high A+T 
contcntj'at the step of template gene digestion. Effects of this potential bias 
on the- overall-mutation rate and recombination frequency may.be avoided by- 
using *the random-priming approach. Bias in random priming due to 
preferential hybridization to GC-rich regions of the template DNA could be 
overcome by increasing the A and T content in the random oligonucleotide 
library. 

An important part of practicing the present invention is controlling the 
average size of the nascent, single-strand DNA synthesized during the random ' 
priming process. This step has been srudied in detail by others. Hodgson and 
Fisk (30) found that the average size of the synthesized single-strand DNA is 
an inverse function of primer concentration: length - V:/ vMnPc , where P c is 
the primer concentration. The inverse relationship between primer concen- 
tration and output DNA fragment size may be due to steric hindrance. Based 
on this guideline, proper conditions for random-priming synthesis can be 
readily set for individual genes of different lengths. 
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Since dozens of polymerases are currently available, synthesis of the 

• short, nascent DNA fragments can be achieved in a variety of fashions. For 
example, bacteriophage T4 DNA polymerase (23) or T7 scquenase version 2.0 
DNA polymerase (31,32) can be used for the random priming synthesis. 

For single-stranded polynucleotide templates (particularly for RNA ■ 

• templates), a reverse transcriptase is preferred for random-priming synthesis. 
Since this enzyme lacks^'-^' exonuciease activity, it is rather prone to error. 

In the presence of high concentrations of dNTPs and Mn 2+ , about 1 base in 

every 500 is misincorporated (29). . ' 

By modifying the reaction conditions, the PCR can be adjusted for the 
random priming 'synthesis using thermostable polymerase for the short, 
nascent DNA fragments. An 'important consideration is to identify by routine 
experimentation the reaction conditions which ensure that the short random 
primers can anneal to the templates and give sufficient DNA amplification at 
higher temperatures. V/e have found that random primers as short as dp(N)io 
can be used with PCR to generate the extended primers. Adapting the PCR to 
the random priming synthesis pro-ices a convenient method to make short, 
nascent DNA fragments and ma>es this random priming recombination 
technique very robust. 

In - many evolution scenarios, recombination should be conducted 
between oligonucleotide sequences for which sequence information is available 
for' at least some of the template sequences. In such scenarios, it is often 
possible to define and synthesize a scries of primers which arc interspersed 
• between the various mutaticr.s. When defined primers arc used,, they can be 
' between 6 and 100 bases long. In accordance- with tine present invention, it 
was discovered that by allowing these defined primers to initiate, a series of 
overlapping primer extension reactions (which may be facilitated by 
tin ermocy cling), it is possible to generate recombination cassettes each 
containing one or more of' the accumulated mutations, allelic or isotypic 
differences between templates. Using the defined primers in such a. way that 
overlapping extension products are generated in the DNA polymerization 
reactions, exhaustion of available primer leads to the progressive cross- 
hybridization of primer extended products until complete gene products arc 
generated. The repeated rounds of annealing, extension and denaturation 
assure recombination of each overlapping cassette with every other. 

A preferred embodiment of the present invention involves methods in 
which a set of defined oligonucleotide primers is used to prime DNA synthesis. 
FIG. 2 illustrates an exemplar/ version of the present invention in which 
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defined primers are used. Careful design and positioning of oligonucleotide 
primers facilitates the generation of non-random extended recombination 
primers and is used to determine the major recombination (co-segregation) 
events along the length of homologous templates. 

Another embodiment of the present invention is an alternative ' 
approach to primer-based gene assembly and recombination in the presence 
of template. Thus, as illustrated in FIG. 3, the present invention includes 
recombination in. which enzyme -catalyzed DNA polymerization is allowed to 
proceed only briefly (by limiting the time and lowering the temperature of the t 
extension step) prior to denaturation. • Denaturation is followed by random 
annealing of the extended fragments to template sequences and continued 
partial extension. This process is repeated multiple times, depending on the 
concentration of primer and template, until full length sequences arc made. 
This process. is called staggered extension, or StEP. "Although random primers 
can also be used for StEP, gene synthesis is not nearly as efficient as with 
defined primers. Thus defined" primers are preferred. 

In this method,' a brief annealing/extension step(s) is used to generate 
the partially extended primer. A typical annealing/extension step is done 
under conditions which allow high .fidclir/ primer annealing (Tannealing greater 
than Tm*~ 5 ). but limit the polymerization/extension to. no more than a few 
seconds (or an average extension to less than 300 nts). Minimum extensions 
are preferably on the order of 20-50 nts. It has been demonstrated that 
thermostable DNA polymerases typically exhibit maximal pol>Tncrization rates 
cf 100-150 nucieo'tidcs/second/cnzyme molecule at optimal -tempcrarures, 
but follow approximate Arrheiiius kinetics at tempcraaires approaching the 
optimum temperature (Top;)- Thus, at a temperature of 55°C, a thermostable 
polymerase exhibits only 20-25% of the steady state polymerization rate that it 
exhibits at. 72'C (Topi), or 24 nts/second (40). At 37°G and 22'C, Tag' 
polymerase is reported to have extension activities of - 1.5 and 0.25' 
nts/second, respectively (24). Both time -and temperature can be routinely 
altered based on the desired recombination events and knowledge of basic 
polymerase kinetics and biochcm:s:ry. 

The progress of the staggered extension process is monitored by 
removing aiiquots from the reaction rube at various time points, in the primer 
extension and separating DNA fragments by agarose gel electrophoresis. 
Evidence of effective primer extension is seen from the appearance of a low 



molecular weight 'smear' early in the process which increases in molecular 
weight with increasing cycle number. 

Unlike the gene amplification process (which generates new DNA 
exponentially), StEP generates new DiNA fragments in an additive manner in 
its early cycles which contain DNA segments corresponding to the different 
template genes. Under non-amplifying conditions, 20 cycles of StEP generates 
a maximal molar yield of DNA of approximately 40 times the initial template 
concentration. In comparison, the idealized polymerase chain reaction 
process for gene amplification is multiplicative throughout, giving a maximal 
molar yield of approximately 1 x 10 6 -fold through the same number of steps. 
In practice, the difference between the two processes can be observed by PCR, 
giving a clear *band' after only a few (less than 10) cycles when starting with 
template at concentrations of less than 1 ng/ul and primers a; I0-500-fo!d 

excess (vs. 10 6 -fold excess typical of gene amplification). Under similar 
reaction conditions, the StEP would be expected to give a less visible 'smear', 
which increases in molecular weight with increasing number of cycles. When 
significant numbers' of primer extended DNA molecules begin to'rcach sizes of 
greater than 1/2 the length of the full length gene, a rapid jump in molccular 
weight occurs, as half- extended forward and reverse strands begin to cross - 
hybridize to generate fragments nearly 2 Limes the size of those encountered 
to that point in the process. At this point, consolidation of the smear into a 
discrete band of the appropriate molecular weight can occur rapidly by cither 
continuing to subject the DNA to StEP, or altering the thcrmccyclc to allow 
complete extension of the primed DNA to drive exponential gene amplification. 

..Following, gene assembly .(and, . .if.. necessary, conversion- to - double 
stranded form) recombincd ^ cr.es are amplified (optional), digested with 
suitable restriction enzymes and ligated into expression vectors for screening 
of the expressed gene products. The process can be repeated if desired, in 
order to accumulate sequence changes leading to the evolution of desired 
functions. 

The staggered extension and homologous gene assembly process (StEP) 
represents a powerful, flexible method for recombining similar genes in a 
random or biased fashion. The process can be used to concentrate 
recombination within or away from specific regions of a known series of 
sequences by controlling placement of primers and the time allowed for 
annealing/extension steps. It can also be used to recombine specific cassettes 
of homologous genetic information generated separately or within a single 
reaction. The method is also applicable to recombining genes for- which no 
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sequence information is' available but for which functional 5' < and 3' 
amplification primers can be prepared. Unlike other recombination methods, 
the staggered extension process can be run in a single tube using 
conventional procedures without complex separation or. purification steps. 

Some' of the advantages of the defined -primer embodiments of the 
present invention are summarized as follows: 

1. The StEP method does not require separation of parent molecules from 
assembled products. 

2. Defined' primers, can be used to bias, the location of recombination 
events. 

3. StEP allows the recombination frequency to. be adjusted by varying 
extension times. 

4. The recombination process can be carried out in a single tube. > . ■ 

5. The process can be. carried out on single-stranded or double-stranded 
polynucleotides. . 1 

6. ' The process avoids the bias introduced by DNase I' or other 
cndonucleases. .* 

7. Universal primers can be used. " 

8. Defined primers exhibiting limited randomness can be used to increase 
the frequency of mutation at selected areas of the gene. \ 

As will be appreciated by those stalled in the art, several embodiments 
of the present invention are possible. Exemplary embodiments include: 

1. Recombination and point mutation of related genes using only defined 
'Ranking primers and staggered extension. 

2. Recombination and mutation of related genes using flanking primers 
and a series of internal primers at. low enough concentration that exhaustion 
of the primers will occur- over the course of the the rmocy cling, forcing the 
overlapping gene fragments to cross-hybridize and extend until recombined 
synthetic genes are formed. 

3. Recombination ar.d mutation of genes using random-sequence primers 
at high concentration to generate a pool of short DNA fragments which arc 
reassembled to form new genes. 
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4. . Recombination and mutation of genes using a set of defined primers to 
generate a pool of DNA fragments which are reassembled toform new genes. 

5. Recombination and mutation of single-stranded polynucleotides using 
one or more defined primers and staggered extension to form new genes. 

6. Recombination using defined primers with limited randomness at more 
than 30% or more than 60% of the nucleotide positions within the primer. 

Examples of practice showing use of the primer-based recombination 
method are as follows. ' • 

EXAMPLE 1 

Use of defined flacking primers and staggered extension to 
recombine and enhance the thermostability of subtilisin E 

This example shows how the defined primer recombination method can 
be used to enhance the thermostability of subtilisin E by recombination, of two 
genes known to encode subtilisin E variants with thermostabilities exceeding 
that of wild-type subtilisin E. This example demonstrates the general method 
outlined' in FIG. 3 utilizing only two primers corresponding to the 5' and 3' 
ends of the templates. 

As outlined in FIG. 3, extended recombination primers arc first 
generated by the staggered extension process (StEP), which consists of 
reseated eveics of denaturati'on followed by extremely abbreviated 
annealing/ extension s:c?(s). The extended fragments arc reassembled into 
full-length genes by ■ ■thermocr.-clLng-assistcd homologous gene assembly in. the 
presence of a DNA polymerase, followed by an optional gene amplification 
step. 

Two thermostable subtilisin E mutants Rl and R2 were used to test the 
defined primer based recombination technique using staggered extension. 
The positions at which these two genes differ from one another are shown in 
Table 1. Among the ten nucleotide positions that differ in Rl and R2, only 
those mutations leading to amino acid substitutions Ash 181-Asp (NISID) and 
Asn 213-Ser (N218S) confer thermostability. The remaining mutations arc 
neutral with respect to their effects cn thermostability (33).. The half-lives at 
65°C of the single variants NISID and N21S5 arc approximately 3-fold and 2- 
fold greater than that of wild type subtilisin E, respectively, and their melting 
temperatures, T a are 3.7°C and 3.2'C higher than that of wild type enzyme, 
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respectively. Random recombination events that yield sequences containing 
both these functional mutations will give rise to .enzymes whose half lives at 
65*C are approximately 8-fold greater than that of wild type subtilisin E, 
provided no new deleterious mutations are introduced into these genes during 
the recombination process. Furthermore, the overall point mutagenesis rate 
associated with the recombination process can be estimated from the catalytic 
. - activity profile of a small sampling of the recombined variant library. If the 
point mutagenesis rate is zero, 25% of the population should exhibit wild type - 
like activity, 25% of the population should have double mutant 
(N181D+N218S)-like activity and the remaining 50% should have single 
mutant (N181D or N218S) : Iike activity. Finite point mutagenesis increases 
the fraction of the library that encodes enzymes with wild -type like (or lower) 
activity. This fraction can be used to estimate the point mutagenesis rate. 

' TABLE 1 ' , 

DNA and amino acid substitutions in thermostable 
subtilisin E mutants Rl and R_2. 







Base 


Position 




Amino acid 


Gene 


Base 


Substitution 


in codon .■ 


Amino acid 


substitution 




780- 


A — ► G 


■ . 2 


109 


Asn->Scr 


R 1 


1107 


A -> G 


2 


. 218 


Asn->Ser 




114 1 


A — » T* 


3 ■ 


229 


'synonymous' 




1153, 


A - G 


3 .' 


233 


synonymous 




484 


' A - G 


■ 3 


10 


synonymous 




; 520 


A -» T . 


3 * 


22 .. 


synonymous 




593 . 


■ A — 9 G 


3 


48 


synonymous 




.731 


G -> A 


1 


93 


Val-»Ile 


R2 


745 ; 


' T -> C . 




; 97. 


synonym'ous 




■ 780 . 


A-+G 


2 


109 • 


Asn->Scr 




995 : 


A -* G 


' X 


181"' 


Asn-»Asp 




1189 


A -> G 


. 3 


24 5 


synonymous 


Mutations 


listed are 


: relative to' wild 


type subtiiis 


;iri E with base 


substitution at 


780 in common. 
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Materials and Methods 

Procedure for defined primer based recombination using two flanking primers. 

Two defined primers, P5N (5'-CCGAG CGTTG CATAT G TGOA AG-3' 
(SEQ. ID. NO: 1), underlined sequence is Ndel restriction site) and P3B (5'- 
CGACT CTAGA GGATC C GATT C-3' (SEQ. ID. NO: 2), underlined sequence is 
BamHI restriction site), corresponding to 5' and 3' Qanking primers, 
respectively, were used for recombination. Conditions (100 ul final volume): 
0.15 pmol plasmid DNA containing genes Rl and R2 (mixed at 1:1) were used 
as template, 15 pmol of each flanking primer, 1 times Taq buffer, 0.2 mM of 
each dNTP, 1.5 mM MgCh and 0.25 U Taq polymerase. Program: 5 minutes of 
95°C, 80 cycles of 30 seconds 94°C, 5 seconds 55°C. The product of correct 
size (approximately lkb) was cut from an 0.8% agarose gel after 
electrophoresis and purified using QIAEX II gel extraction kit. This purified 
product was digested with Ndel and BamHI and subcloned into p3E3 shuttle 
vector. This gene library was amplified in £. coli HB101 and transferred into 
B. subtilis D342S competent cells for expression and screening, as described 
elsewhere (35). , 

DnA sequencing 

Genes were purified using QiAprep spin plasmid miniprep ki: to obtain* 
sequencing quality DNA. - Sequencing was done on an A3I 373 DNA 
Sequencing System using the Dye Terminator Cycle Sequencing kit (Pcrkin- 
Eirr.cr, Branchburg. NJ). 

Results 

The progress of the staggered extension was monitored _by removing 
aiiquots (10 ulj from the reaction tube at various time points in the primer 
extension process and separating DNA fragments by agarose gel 
electrophoresis. Gel electrophoresis of primer extension reactions revealed 
that annealing/extension reactions of 5 seconds at 55 3 C resulted Ln the 
occurrence of a smear approaching 100 bp (after 20 cycles), 400 bp (after 40 
cycles), 800 bp (after 60 cycles) and finally a strong approximately 1 kb band 
within this smear. This banc (mixrurc of reassembled products) was gel 
purified, digested with restriction crjr.^e BamHI and Ndel, and ligated with 
vector generated bv BamHI- Ndel digestion of the E. coli / B. subtilis d3E3 
shuttle vector. This gene library was amplified in. £. coli H3101 and 
transferred into 3. subtilis D3423 competent cells for expression and 
screening (35). 
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The thermostability of enzyme variants was determined in the 96-weU 
plate format described previously (33). About 200 clones were screened, and 
approximately 25% retained subtilisin activity. Among these active clones, the 
frequency of the double mutant-like phenotype {high thermostability) was 
approximately 23%, the single mutant-like phenotype was approximately 42%, 
and wild type-iike phenotype was approximately 34%. This distribution is 
very close to the values expected when the two thermostable mutations N218S 
and N 1 8 1 D can recombine with each other completely freely. 

Twenty clones were randpmjy picked from £. coli HB101 gene library. 
Their plasmid DNAs were isolated and digested with.Ndel and BamHI. Nine 
out of 20 (45%) had the inserts of correct, size (approximately 1 kb). Thus, 
approximately 55% of the above library had no activity, due to lack of the 
correct subulisin E gene. These clones are not 'members of the subtilisin 
library and should be removed from our calculations. Taking into account 
this factor, we find that 55% of the library (25% active clones/45% clones with 
correct size, insert) retained subtilisin activity. This activity profile indicates a 
point mutagenesis rate of less than 2 mutations per gene (36); Five clones with 
inserts of the correct size were sequenced. The results are summarized in FIG; 
4. All five genes arc recombination products with minimum crossovers 
varying from 1 to 4. Only one new point mutation was found in these five 
genes. 

EXAMPLE 2 

Usc.of, defined flanking primers and staggered extension 
to recombine ph'B esterase mutants 

The two-primer recombination method used here for pN3 esterase is 

analogous to that described in Example 1 for subtilisin E. Two template pNB 

esterase mutant genes, that diner at 14 bases are used. Both templates (61C7 

and 4G4) are used in the plasmid form. Both .target genes are present in the 

extension reaction at a concentration of 1 ng/ul. Flanking primers (RM 1 A and 

RM2A, Table 2) are added at a final' concentration of 2 ng/ul (approximately 

200-fold molar excess over template;. 
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TABLE 2 

Primers used in the recombination of the pNB esterase genes 
Primer Sequence 

RM 1 A GAG CAC ATC AGA TCT ATT AAC (SEQ. ID. NO: 3) 

RM2A GGA GTG GCT CAC AGT CGG TGG (SEQ. ID. NO: 4) 

Clone 61C7 was isolated based on its activity in organic solvent and 
contains 13 DNA mutations vs. the wild-type sequence. Clone 4G4 was 
isolated for thermostability and contains 17 DNA mutations when compared 
with wild-type. Eight mutations are shared between them, due to common 
ancestry. The gene product from 4G4 is significantly more thermostable than 
the gene product from 61C7. Thus, one measure of recombination between 
the genes is the co-segregation of the high solvent activity and high 
thermostability or the loss of both properties in the recombined genes. In 
addition, recombination frequency and mutagenic rate can be ascertained by 
sequencing random clones. 

For the pN3- esterase gene, primer extension proceeds through 90 
rounds of extension with a thermocycle consisting of 30 seconds at 94*C 
followed by 15 seconds at 55 a C. Aliquots (10 \x\) are removed following cycle 
20, 40, 60, 70, SO and 90. Agarose gel electrophoresis reveals the formation 
of a .low -molecular vvcigh-t 'smear' by cycle 20, which increases in average size 
and overall intensiry a: each successive sample point. By cycle 90, a 
pronounced smear is evident emending from 0,5 kb to 4 kb, and exhibiting 
mar-dmal signal intensity a: a size of approximately 2 kb (the length of -the full 
length genes). The jump from half-length to fulHength genes appears to occur 
between cycles 60 and 70. 

The intense smear is amplified through 6 cycles of polymerase chain 
reaction to more clearly define the full length rccombined.gcne population. A 
minus-primer control is also amplified with flanking primers to determine the 
background due. to residual template in the reaction mix. Band intensity from 
the primer extended gene population exceeds that of the control by greater 
than. 10- fold, indicating that amplified, non'-recombined. template comprise 
only a small fraction of the amplified gene population. 

The amplified reepmbmed gene pool is digested with restriction 
enzymes Xbal and BamHI and ligated into the pNB106R expression vector 
described by Zock et aJ. (35). Transformation of ligated DNA into E. coli strain 
TGI is done using the' well characterized calcium chloride transformation 



WO 98/42832 



PCT/US98/05956 



V- : ■ 

procedure. Transformed colonies are' selected on LB/agar plates containing 
20 ug/ml tetracycline. ■. 

The mutagenic rate 'of the process is determined by .measuring the 
percent of clones expressing an. active esterase (20). In addition, colonies, 
picked at random are sequenced and used to define the mutagenic frequency 
of the method and the efficiency of recombination. . - . 

EXAMPLE 3 . . . 

Recombination of pNB esterase genes using interspersed' 
internal defined primers, and staggered extension , 

This example demonstrates that .the interspersed defined primer 
recombination technique can produce novel sequences through point 
' mutagenesis- and" recombination" of ' mutations- present in the parent 
sequences. ■ 

.•'Expcrimcntai design and background information 

, Two pNB" esterase genes (2-13 and 5-312) were recombined using the 
defined primer recombination technique.. Gene products" from both 2- 13 'and 
5 ,B12 arc measurably more Lhcrmostablc than wild-type. Gene 2; 13 contains 
9 mutauons' not.originaJly present in/ the wild-rype sequence, white gene 5- 
B12 contains 14.' The positions, a: which" these two genes differ from one 
another arc shown in FIG. 5. . . * • . 

. Table 3 shows, the sequences.of the eight primers used in this example. 
• Location (at the 5' end c: the .template gene] of oligo annealing to the template 
genes is indicated in the table, as is primer orientation (F indicates a forward 
•' primer, R indicates 'reverse). These primers are shown as arrows along gene 
2-13 in FIG. 5. • '■'•.,■•. 
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TABLE 3 

Sequences of primers used in this example 
name orientation location sequence 



RM1A 


F 


-76 


GAGCACATCAGATCTATTAAC (SEQ. ID. NO: 3) 


RM2A 


R 


. +454 


GGAGTGGCTCACAGTCGGTGG (SEQ. ID. NO: 4} 


S2 


F 


400 


TTGAACTATCGGCTGGGGCGG (SEQ. ID. NO: 5] 


S5 


F 


1000 


TTACTAGGGAAGCCGCTGGCA (SEQ. ID. NO: 6) 


S7 


F 


1400 


TCAGAGATTAGGATCGAAAAC (SEQ. ID. NO: 7) 


SS 


R 


1280 


GGATTGTATCGTGTGAGAAAG (SEQ. ID. NO: 8] 


S10 


R 


880 


AATGCCGGAAGCAGCCCCTTC (SEQ. ID. NO: 9; 


S13 


R 


280, 


CACGACAGGAAGA1 1 1 1GACT (SEQ. ID. NO: 1C- 



Materials and Methods 

Defined-primer based recombination t 

1. Preparation of genes to be recombincd. Plasmids containing the genes 4, 
to be recombtned were purified from trans formed TGI cells, using the Qiaprep \ 
kit (Qiagen, Chatsworth, CA). Plasmids were quanutated by UV absorption 

and mixed 1:1 for a final concentration of 50 ng/ul. 

2. Staggered extension PGR ar.d reassembly. 4 ^il of the plasmid mixture 
was used as template in a 100 standard reaction (1.5 mM MgClj, 50 mM 
KCl, lO'rnM Tris-KCi pH 9.0, 0.1% Triton X-100, 0,2 mM dNTPs, 0.25 U Taq 
polymerase (Promcga, .Madison, Vvl)) which aiso contained 12.5 ng of each of 
the S orimers. A control reaction which contained no primers was also 
assembled. Reactions were thermocycled through 100 cycles of 94°C, 30 
seconds; 55 3 C, 15 seconds. Chec ~g an aliquot of , the reaction on an agarose 
gel at this point showed the product to be a large -smear (with no. visible 
product in the no primer control). 

3. Dpnl digestion of the templates. 1 yA from the assembly reactions was 
then digested with Dpnl to remove the template plasmid; The 10 jjd Dpnl 
digest contained 1 x NEBuffer 4 ar.d 5 U Dpnl (both obtained from New 
England Biolabs, Beverly, MA) and was incubated at 37°C for 45 minutes, 
followed by incubation at 70'C for 10 minutes to heat kill the enzyme. 

4. PCR amplification :-f tine reassembled products. The 10 ul digest was 
then added to 90 -1 of a standard PCR reaction (as described in step 2) 
containing 0,4 uM primers 5b ( A CTT AA7 CT A G A G G GT ATT A) (SEQ. ID. NO: 1 1 1) 
and 3b (AGCCTCGCGGGATCCCCGGG) (SEQ. ID. NO: 12) specific for the ends 
of the gene. After 20 cycles of standard PCR (94°C, 30 seconds; 48 3 C, 30 
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seconds, 72°C ( 1 minute) a strong band of the correct size (2 kb) was visible 
when the reaction was checked on an agarose gel, while only a very faint band 
was visible in the lane from the no-primer control. The product band was 
purified and cloned back into the expression plasmid pNB106R and 
•' transformed. by electroporation into TGI cells. 

• Results 

Four 96 well plates of colonies resulting from this transformation were 
. assayed for pNB esterase initial activity and thermostability. Approximately 
60% of the clones exhibited initial activity and thermostabilty within 20% of 
the parental, gene values. Very few (10%) ofihe clones were inactive (less than 
10% of parent initial activity values). These results suggest a low rate of. 
■ mutagenesis. Four mutants with the highest thermostability values were 
sequenced. Two clones (6E6 and 6H1) were the result of recombination 
between the' parental genes (FIG. 5). One of the remaining, two clones 
contained a novel point mutation, and one showed no difference from parent 
5B12. The combination of mutations T99C-and C204T in mutant 6E6 is 
evidence for a recombination event between these two sites, in addition, 
mutant 6H1 shows the loss of mutation A1072G (but the retention of 
mutations C103BT and T1310C), which is evidence, for two recombination 
events (one be ween sites 102S and 1072, and another ' between 1072 and. ' 
1310). A total of five new point mutations were found in the four genes 
sequenced. • ' ■ 

EXAMPLE 4 

Recombination of two thermostable subtilisLn E .variants 
using internal defined primers and staggered extension 

This example demonstrates that the defined' primer recombination. 

technique can produce hovel sequences' containing new combinations of 

mutations present in the parent sequences. It further demonstrates the utility 

of the defined primer recombination technique to obtain further Improvements 

Ln enzyme performance (here, thermostability). This example further shows 

that the defined primers can bias tine recombination so that recombination 

"appears most often Ln the portion of the sequence defined by the primers 

(inside the primers). Furthermore, this example shows that specific mutations 

can be introduced into the recombineti sequences by using the appropriate 

defined primer sequence(s) containing the desired mutation(s). 
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• Genes encoding two thermostable subtilisin E variants, of Example 1 
(Rl and R2) were recombined using the defined primer recombination 
procedure with internal primers. FIG. 6 shows the four defined internal 
primers used to generate recombined progeny genes from template genes Rl 
and R2 in this example. Primer P50F contains a mutation . (A-*T at base 
position 598) which eliminates a Hindlll restriction site and simultaneously 
adds a new unique Nhel site. This primer is used to demonstrate that specific 
mutations can also be introduced into the population of recombined 
sequences by specific design of the defined primer. Gene R2 also contains a 
mutation A-+G at the same base position, which eliminates the Hindlll site. 
Thus restriction analysis (cutting by Nhel and Hindlll) of random clones 
sampled from the recombined library will indicate the efficiency of 
recombination and' of the introduction of a specific mutation via the mutagenic 
primer. Sequence analysis of randomly-picked (unscreened) clones provides 
further m formation on the recombination and mutagenesis events occurring 
during defined primer-based recombination. 

Materials and Methods 

Defined; primer based recombination 

A version of the defined primer based recombination illustrated in FIG. 
2 was carried out with the addition of StEP, 

1. • Preparation of genes ;o be recombined. About 10 ug of plasmids 
containing Rl and R2 gene were' digested at 37°C. for 1 hour with A'de/ and 
3amHl (30 U each) in 50 pi of lx buffer B (Bochringer Mannheim, 
Indianapolis, IN). Inserts of approximately 1 kb were purified from 0.8% 
preparative agarose gels using QLAZX II gel extraction kit. The DNA inserts 
were dissolved in 10 mM Tris-HCl (pH 7.4). The. DNA concentrations were 
estimated, and the inserts were mixed 1:1 for a concentration of 50 ng/ul. 

2. Staggered extension ?CR and reassembly. Conditions (100 ul final 
volume): about 100 ng inserts were used as template, 50 ng of each of 4 
internal primers, lx Taq buffer,. 0.2 mM of each dNTP, 1.5 mM MgCb and .25 
U Taq polymerase. Program: 7 cycles of 30 seconds at 94°C, 15 seconds at 
55 ? C, followed by another 10 cycles of 30 seconds at 94°C, 15 seconds at 
55°C, 5 seconds at 72°C (staggered extension), followed by 53 cycles of 30 
seconds at 94°C, 15 seconds at 55 3 C, 1 minute at 72'C (gene assembly). 

3. ■ Dpnl digestion of the templates. 1 pi of this reaction was diluted up to 
9.5 pi with dHjO and 0.5 pi of Dpnl restriction enzyme was added to digest the 
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DNA template for 45 minutes, followed oy incubation at 70'C for 10 minutes 
and then this 10 ul was used as template in a 10-cycle PCR reaction. 
4. PCR amplification of reassembled products. PCR conditions (100 ul 
final volume): 30 pmoi of each outside primer P5N and P3B, lx Taq buffer, 0.2 
v mM of each dNTP and 2.5 U of Taq polymerase. PCR program: 10 cycles of 30 
_ seconds at 94°C, 30 seconds at 55 # C, 1 minute. at 72°C. This program gave a 
single band "at the correct size. The product was purified and subcloned into 
• : pBE3 shuttle vector. This gene Library was amplified in E. coli HB101 and 
transferred into B. subtilis D3428 competent cells for expression and 
screening, as described elsewhere (35). Thermostability of enzyme variants 
was determined in the 96-weIl plate format.-described previously (33). 

DNA sequencing 

Ten E. coli HB101 transform ants were chosen for sequencing. Genes 
were purified using QLAprep spun piasmid miniprep kit to obtain sequencing 
quality DNA. Sequencing was cone c'r. an ABl 373 DNA Sequencing System 
using the Dye Terminator Cycle Sequencing kit (Perkin-Elmer, Branchburg, 
NJ). 

Results ; 
1 j restriction analysis: . 

Forty clones randomly picked from the rccombined library were 
digested with restriction enzymes- Nhel and SamKI. In a separate experiment 
the same forty piasmids were 'digested with Hindlll and BamRl. These 
rcacubn products were analyzed by gel electrophoresis. As shown in FIG. 7, 
eight out of 40 clones (approximately 20%) contain the newly, introduced jVhel 
restriction site, demonstrating that the mutagenic primer has indeed- been 
abic to introduce the specified mutation into the populauon. 

2) DNA sequence analysis ■ 

The first ten randomly picked clones were subjected to sequence 
analysis, and the results are summarized in FIG. 8. A minimum of 6 out of 
the 10 genes have undergone re::.mb:r.auon. Among these 6 genes, the 
minimal crossover events (reccmbinrit:::;) between genes Ri and R2 vary from 
" 1 to 4. All visible crossovers occurred within the region defined by the four 
primers. Mutations outside this region are rarely, if ever, recombined, as 
shown by the fact that there is no recombination between the two mutations 
at base positions 484 and 520. These results show that the defined primers 
can bias recombination so that i: appears most often in the portion of the 
sequence defined by the primers (inside the primers). Mutations very close 
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together also tend to remain together (for example, base substitutions 731 and 
745 and base substitutions 1141 and 1153 always remain as a pair). 
However, the sequence of clone 7 shows that two mutations as close as 33 
bases apart can be recombined (base position at 1107 and 1141). 

Twenty-three new point mutadons were introduced in the ten genes 
during the process. This error rate of 0.23% corresponds to 2-3 new point 
mutations per gene, which is a rate that has been determined optimal for 
generating mutant libraries for directed enzyme evolution (15). The mutation 
types are listed in Table 4. Mutations are mainly transitions and are evenly 
distributed along the gene. 

TABLE 4 



New poi 


at mutations identified 


in ten recombined 


genes 


Transition 


Frequency 


Transversion 


Frequency 


G — ► A 


■ 4 


- A->'T 


1 * 


A -> G 


4 


A -> C 


1 I 


C -> T 


3 


C -> A 


1 


T -v C 


5 


C -> G 


0 






G -> C 


• 1 






G -> T 


0 






T -» A 


3 . 






T -> G 


0 



A total of 9S60 bases were sequenced, t he nutation rate was 0.23% 



4) Phenotypic analysis 

Approximately 450 B. subtilis D3428 clones were picked and grown in 
SG medium supplemented with 20 ug/ml kanamycin in 95-well plates. 
Approximately 56% of the clones expressed active enzymes. From previous 
experience, we know that this level of inactivation Indicates a mutation rate on 
the order of 2-3 mutadons per gene (35). Approximately 5% clones showed 
double mutant ( N 1 S 1 D + N2 1 8 S) - lik c phenotypes (which is below the expected 
25% value for random recombination alone due primarily to point 
mutagenesis), (DNA sequencing showed that two clones, 7 and 8, from the ten 
randomly picked clones contain both N21SS and NISID mutadons.) 
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EXAMPLE 6 

Optimization of the Actinoplanes utahensis ECB deacylase by 
the random-priming recombination method 

' In this example, the method is used to generate short DNA fragments 
from denatured, linear^ double-stranded DNA (e.g., restriction fragments 
purified by gel electrophoresis; 22). The purified DNA, mixed with a molar 
excess of. primers, is denatured by boiling, and synthesis is then carried out 
using the Klenow fragment of E. coli DNA polymerase I. This enzyme lacks 
5'->3' exonuclease activity, so that the random priming product is synthesized 
exclusively by primer extension and is not degraded by exonuclease. The 
reaction is carried out at pH 6.6, where the 3'->S' exonuclease activity of the 
enzyme is much reduced (35). These conditions favor random initiation of 
synthesis,. - 

The procedure involves the following steps: 

1 Cleave the DNA of interest with appropriate restriction 
endonuclcase(s) and purify- the DNA fragment of interest by gel electrophoresis 
using Wizard PCR Prep Kit (Promcga, Madison, Wl). As an example, the 
Actinoplanes utahensis ECB deacylasc gene was cleaved as a 2.4 kb-kmg Xho. 
1-PshM fragment from the recombinant plasmid pSHPlOO. It was essential to 
linearize, the DNA for the subsequent cenaruralion step. The fragment was 
purif.cd by -agarose .gel electrophoresis -sing the Wizard PCR Pre? Kit 
(Promcga, Madison, Wl) (FiG.9, step (a)). Gel purification was also essentia! in 
order to remove the. restriction endonuciease buffer from the 'DNA, since the 
Mg 2 * ions make it difficult to denature the DNA in the next step. 

2 400 ng (about 0.51 pmol) of the double-stranded DNA dissolved 
in KiO was mixed with 2.75 ng (about 1.39 nmol) of dp(N)6 random primers. 
'.After immersion in '.boiling- water for 3 minutes, the mixture was placed 

immediately in an ice/ethano! bath. 

The size of the random priming products is an inverse function of the 
concentration of primer (33). The presence of high concentrations of primer is 
thought to lead to steric hindrar.ee. Under the reaction conditions described 

.here the random priming products are approximately 200-400 bp, as 
determined by electrophoresis through an alkaline agarose gel (FIG. 9 step b). 

3 Ten pi of 10 x reaction buffer [10X buffer: 900 mM HEPES, pH 
6.6; 0.1 M magnesium chloride, 10 mM dithiothreito!, and 5 mM each dATP, 
dCTP, dGTP and dTTP) was added to the denatured sample, and the total 
volume of the reaction mixture was brought u? to 95 p! with H,0. 
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4. Ten units (about 5 pi) of the Klenow fragment of E.coh DNA 
polymerase I was added. All the components were mixed by gently tapping the 
outside of the tube and were centrifuged at 12,000 g for 1-2 seconds in a 
microfuge to move all the liquid to the bottom. The reaction was carried out at 
22°C for 35 minutes. 

The rate of the extension depends upon the concentrations of the 
template and the four nucleotide precursors. Because the reaction was 
carried out under conditions that minimize exonucleolytic digestion, the newly 
synthesized products were not degraded to a detectable extent. 

5. After 35 minutes at 22°C, the reaction was terminated by cooling 

the sample to 0°C on ice. 100 pi of ice-cold HoO was added to the reaction 
mixture. 

6. The random primed products were purified by passing the whole 
reaction mixture through Centricon- 100 (to. remove the template and proteins) , 
and Centricon- 10 filters (to remove the primers and fragments less than 50 » 
bases), successively. Centricon filters are available from Amicon inc (Berverly, 1 
MA). The retentate fraction (abcu: S5 pi in volume) was recovered from . 
Centricon- 10. This fraction contained the desired random priming products 
(FIG. 9, step c) and was used for whole gene reassembly. 

Reassembly of the whole ger.e was accomplished by the following steps: 

l' For reassemblv by PCR, 5 yl of the random-primed DNA 
fragments from Centricon- 10, 20 pi of 2>: PCR prc-mLx (5-fold diluted cloned 
Pfu buffer, 0.5 mM each dNT?, 0.1U/pl cloned Pfu polymerase (Stratagene, La 

• JoMa.-CAD.-S-pl of 30Vo-'(v./v.) glycerol and 7 .pi of H 2 0,were mixed on ice.. Since., 
the concentration of the random-primed DNA fragments used for reassembly 
is the most important variable, it is useful to set up several separate reactions 
with different concentrations to establish the preferred concentration. 

' 2. After incubation at- 95°C for 6 minutes, 40 thcrmocycles were 
performed, each with 1.5 minutes at 95'C, 1.0 minutes at 55°C and 1.5 

• minutes + 5 second/cycle at 72'C. with the extension step of the last cycle 
- proceeding at 72°C for .10 minutes, in a DNA Engine PTC-200 (MJ Research 

Inc., V/aterxown, MA) apparatus without adding any mineral oil. 

3. 3 pi aliquots at cycles 20, 30 and 40 were removed from the 
reaction mixture and analyzed by agarose gel electrophoresis. The 
reassembled PCR product at 40 cycles contained the correct size product in a 
smear of larger and smaller sizes (see FIG. 9, step d). 
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The correctly reassembled product of this first PCR was further 
amplified in a second PCR reaction which contained the PCR primers • 
complementary to the ends of the template DNA. The 'amplification procedure 
was as follows: 

1. 2.0 ill of the PCR reassembly aliquots were used as template in 
100-ul standard PCR reactions, which contained 0.2 mM each primers of 
xhoF28 (5* GGTAGAGCGAGTCTCGAGGGGGAGATGC3*) (SEQ. ID. NO: 13) and 
pshR22 (5* AGCCGGCGTGACGTGGGTCAGC 3') (SEQ. ID. NO: 14), 1.5 mM 
MgCb, 10 mM Tris-HCl [pH 9.0], .50 mM KC1, 200 pM each of the four dNTPs, 
6% (v/v) glycerol, 2.5 U of Taq polymerase (Promega, Madison, WIJ and 2.5 U 
of Pfu polymerase (Stratagene, La Jolla, .CA)- % ' 

2. After incubation at 96 3 C for 5 minutes, 15 thcrmocycles were 
performed, each with 1.5 minutes at 95°C, 1.0 minutes, at 55°C and 1.5 
minutes at 72°C, followed by additional 15 thcrmocycles of 1.5 minutes at 
95°C, 1.0 minutes at 55°C and 1.5 minutes + 5 second/cycle at 72°C with the 

! extension step. of the last cycle proceeding at 72°C for .10 minutes, in a DNA 
Engine PTC-200 (MJ Research Inc., Watenpwn, MA) apparatus without adding 
any mineral oil. 

3. The amplification resulted in a large amount of PCR product . 
with the correct size of the EC3 deacylase whole gene (FIG. 9, step c). 

Cloning was accomplished as follows-: 

ltf -r'he PCR product of EC3 deacylase gene was digested with Xho I 
and Psh AI restriction enryrr.es, and cloned into a modified p-U702 vector. 

2. S. lividans TK23 protoplasts were transformed with the above 
Ligation mixture to form a mutant library. 

In situ screening the ECS .deacylase .mutants 

Each transformant within -the -..Si lividans TK23 library obtained as 
described above was screened for deacylase activity with an in situ plate assay 
method using ECB as substrate. Transformed protoplasts were allowed to 
regenerate on R2YE agar plates by incubation at 30'C for 24 hours and to 
develop in the presence of thiosu-cpton for further 48-72 hours. When the 
colonies grew to proper sL-.e, 6 ml of 45 S C purified-agaxose (Sigma) solution 
containing 0.5 mg/ml EC3 in 0.1 M -sodium acetate buffer (pH 5.5) was 
poured on top of each R2YE-agar plate and allowed to further develop for 18- 
24 hours at 37°C. Colonies surrounded by a clearing zone larger than that of a 
conu-ol colony containing wild-type recombinant plasmid pSHP150-2 were 
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indicative of more efficient ECB hydrolysis resulting from improved enzyme 
properties or improved enzyme expression and secretion level, and were 
chosen as potential positive mutants. * These colonies were picked for 
subsequent preservation and manipulation. 

HPLC assav of the ECB deacvlase mutants 

Single positive transformants were inoculated into 20 ml fermentation 
medium containing 5 pg/ml thiostrepton and allowed to grow at 30°C for 48 
hours. At this step, .all cultures were subjected to HPLC assay using ECB as 
substrate. 100 \i\ of whole broth was used for an HPLC reaction at 30°C for 30 
minutes in the presence of 0.1 M NaAcfpH 5.5), 10% (v/v) MeOH and 200 
pg/ml of ECB substrate. 20 pi of each reaction mixture was loaded onto a ■ 
PolyLC polyhydroxyethyl aspartamidc column (4.6 x 100 mm) and eluted by 
acetonithle gradient at a flow rate of 2.2 ml/min. The ECB-nucleus was 
detected at 225 nm. 

Purification of the ECB deacvlase mutants 

After the HPLC assay, 2.0 ml pre-culrures of ail potential positive 
mutants were eh en used to inoculate 50 -ml fermentation medium and allowed 
to grow at 30°C, 280 rpm for 95 hours. These 50-ml cultures were then 
ccntrifuged at 7,000 g for 10 minutes. The supernatants were re-cenrrifuged 
a: 16,000 g for 20 minutes. The supernatants containing the EC3 deacvlase 
mutant ennvrr.es were -stored at - 2CTC . 

'The suoematants from the positive mutants were further concentrated 
to -1/30 their original- volume with an -Am icon Til tration unit with molecular ■ 
weight cutoff of 10 kD. The resulting enzyme samples were diluted with an 
equal volume of 50 mM KH2PO4 (pH 6.0) buffer and 1.0 ml was applied to Hi- 
Trap ion exchange column. The binding buffer was 50 mM KH2PO4 (pH 6.0), 
and the elution 'buffer was 50 mM KruPCU (pH 6.0) and 1.0 M NaCl. A lincar 
gradient from 0 to 1.0 M NaCl was applied in 8 column volumes with a flow 
rate of 2,7 ml/mLn. The EC3 ceacyiase mutant fraction eluted at 0.3 M NaCl 
and was concentrated and buffer exchanged into 50 mM KH2PO4 (pH 6.0) in 
Amicon Centricon-10 units. Enr/me purity was verified by SDS-PAGE, and 
the concentration was determined using the Bio-Rad Protein Assay. 

Specific activity assav of the EC3 deacvlase mutants 

■ 4.0 \ig of each purified ECB deacylase mutant was used for the activity 
assay at 30 a C for 0-60 minutes in the presence of 0.1 M NaAc (pH 5,5), 10% 
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(v/v) MeOH and 200 ng/ml of ECB substrate. 20 pi of each reaction mixture 
was loaded onto a PolyLC polyhydroxyethyl aspartamide column (4.6 x 100 
mm) and eluted with an acetonitrile gradient at a flow rate of 2.2 ml/min. The 
reaction products were monitored at 225 nm and recorded on an IBM PC data 
acquisition system. The ECB nucleus peak was numerically integrated and 
used to calculate the specific activity of each mutant. . 

As shown in FIG.. .10, after only one round of applying this random- 
priming -based technique on the- wild-type ECB deacylase gene, one mutant 
(M16)from 2,012 original transformants was found to possess- 2,4 times the 
specifkactivity of the wild-type enzyme. FIG 11 shows that the .activity of 
M16 has been increased relative to that of, the wild-type enzyme over a broad 
. pH range. 

EXAMPLE 7 • 

Improving. the thermostability Bacillus subfiles subtilisin E 
using the random-sequence primer recombination method 
This example demonstrates the' use of various DNA polymerases for 
. pnrncr-bascd recombination. I: further demonstrates the stabilization of 
subtilisin E by recombination.. 

Genes Rl and P.2 encoding the two thermostable' subtilisin E variants 
! described in Example 1 were chosen as. the templates for recombination. 

(i)-Tcrast gene preparation > 1 

. - . .„M n r p-n-s-' Rl and R2 (FIG. 11) were 

Subtilisin E Lncrrr.o sialic mu.aru g-ub ^ r v ' 

subjected to random primed DNA synthesis. The ^So-b? fragment including 

45 nt of subtilisin E prosequer.ee. the entire mature sequence and 1 13 nt after 

the "stop codon were obtainedbv double digestion of plasmid P BE3 with Bam 

■ . HI and Nde 1 and purified from a 0.8% agarose gel using the Wizard PCR.Prcp • 

Kit (Promega. Madison, Wl).-1: was' essential to linearize the DNA for the 

■ subsequent denaturation step. Gel purification was also essential in order to 
remove the restriction endonucleasc buffer from the DNA, since the Mg^ ions 
make it difficult to denature the DNA in .the next step. 

(2) Random primed D/VA synthesis 

Random primed DNA synthesis used to generate short DNA fragments 
from denatured,' linear, double-stranded DNA. The purified B. subMs 
subtilisin E mutant genes, mixed with, a molar excess of pruners, were 
denatured by boding, and synthesis was then earned out using one of the 
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following DNA polymerases: the Klenow fragment of E. coli DNA polymerase I, 
bacteriophage T4 DNA polymerase and T7 sequenase version 2.0 DNA 
polymerase. 

Under its optimal performance conditions (29), bacteriophage T4 DNA 
polymerase gives similar synthesis results as the Klenow fragment docs. When 
T7 sequenase version 2.0 DNA polymerase (31, 32) is used, the lengths of the 
synthesized DNA fragments are usually larger. Some amount of MnCia has to 
be included during the synthesis in order to control the lengths of the 
synthesized fragments within 50-400 bases. 

* Short, nascent DNA fragments can also be generated with PCR using 
the Stoffel fragment of Taq DNA polymerase or Pfu DNA polymerase. An 
important consideration is to identify by routine experimentation the reaction 
conditions which ensure that the short random primers can anneal to the 
templates' and give sufficient DNA amplification at higher temperatures. We 
have found that random primers as short as dp(N)n can be used with PCR to 
generate fragments. • 

2.1 Random primed DNA synthesis -with the Klenow fragment 

The Klenow fragment of E. coli DNA polymerase [ lacks 5'->3" 
exo'nucleasc activity, so that the random priming product is synthesized 
exclusively by; o rimer extension and is not degraded by cxonuclease. The 
reaction was carried out at pH 6.6, where the 3'-o'cxonuclease activity of the 
enzyme is r much 'reduced (35). These conditions favor random initiation of 
synthesis. 

A.. - . 200. ng (about 0,7 pmol) of Rl DNA and equal amount of R2 DNA 
dissolved in H2O was mixed with 13.25 pg (about 6.7 nmol) of dp(N)6 random 
primers. After immersion in boiling water for 5 minutes, the mixture was 
placed immediately in an ice/cthanol bath. 

" The size of the random priming products is an- inverse function of the 
concentration of primer (30). The presence of high concentrations of primer is 
thought to lead to steric hindrar.ee. Under the reaction conditions described 
here the random priming products arc approximately 50-500 bp, as 

. determined by agarose gel electrophoresis. 
2. Ten ul of 10 x reaction buffer (1 Ox buffer: 900 mM HEPES, pH 6.6; 0.1 
M magnesium chloride, 20 mM dithiothreitol, and 5 mM each dATP, dCTP, 
dGTP and dTTP) was added to the denatured sample, and the total volume of 
the reacuon mixture was brought up to 95 pi with H2O. 
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3. ."' Ten units (about 5 pi) of the Klenow fragment of E.coli DNA polymerase 
I {Boehringer Mannheim, Indianapolis, IN) was added. All the, components 
were mixed by gently tapping the outside of the tube and were centrifuged at 
12,000 g for 1-2 seconds in a microfuge to move all the liquid to the bottom. 
The reaction was carried out at 22'C for 3 hours. 

The rate of the extension depends upon' the concentrations of the 
template and the four nucleotide precursors. Because the reaction was carried 
out under conditions that minimize exonuclebtytic digestion, the newly 
synthesized products were riot degraded to a detectable extent: 

After 3 hours at 22 0 C, the reaction was terminated by cooling the 
samole to O'C on ice, 100 pi- of ice-cold H,0 was added to the reaction 
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mixture. .• . 

.5.. The random primed products .were purified by passing the whole 
reaction mixture through Microcon- 100 (Amicon, Beverly MA), {to remove the 
template and proteins) and Microcon- 10 filters (to remove the primers and . 
fragments less than 40 bases), successively.. The retentate fraction (about 65 
P l in volume) was recovered.frorr. the Microcon- 10. This fraction containing the 
desired random priming products was- buffer-exchanged against PCR reaction 
• buffer with the new Microcon- 10 further use ih whole gene reassembly. 

2.2 Random primed DNA synthesis "with bacteriophage T4 DNA polymerase 

•. - ■ Bacteriophage T~ DNA .ooiymerase and the* Klenow fragment of E.coli 
DNA polymerase 1 are similar ".in that each possesses a 5'-3' polymerase ... 
and a 3'-5' c/.onuciease activity, The -exonuclcases activity ' of 
teriophage T4 DNA polymerase is more'than .200 times that of. the Klenow 
fragment. -Since it. does no", displace the short 'oligonucleotide primers from 
single-stranded' DNA templates (23). the 'efficiency of mutagenesis, is different 
from the Klenow fragment. • • , • ' . 

1. 200' ng (about 0.7 pmoi) iof Rl DNA. 'and'' equal amount of R2 DNA 
dissolved in M 2 0 was mixed with 1 3.25 .'ug (about 6.7 nmol) of dp(N)s random- 
.primer's. After 'immersion, in boiling water, for 5 minutes, the/mixture was 
placed immediately in an ice/.ethar.o! bath. The presence of high concentra- 
tions of primer is thought to lea'd to stcric hindrance. 

2. Ten pi of 10 x reaction buffer [lOx buffer: 500 mM Tris-HCl, pH 8.8:' 
150 mM (NH4)2SO4-.70 mM' magnesium chloride. 100 mM 2-mcrcaptoethanol. 
0.2 mg/ml bovine serum albumin and 2 mM each dATP, dCTP, dGTP and 
dTTP) was added to the denatured sample, and the total volume of the 
reacuon mixture was brought u? to 90 pi with H2O. 
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3. Ten units (about 10 pi) of the T4 DNA polymerase I (Boehringer 
Mannheim, Indianapolis, IN) was added. All the components were mixed by 
gently tapping the outside of the tube and were centrifuged at 12,000 g for 1-2 
seconds in a microfuge to move all the liquid to the bottom. The reaction was 
carried out at 37°C for 30 minutes. Under the reaction conditions described 
here the random priming products are approximately 50-500 bp. 

4. After 30 minutes at 37°C, the reaction was terminated by cooling the 
sample to 0°C on ice. 100 pi of ice-cold H7O was added to the reaction 
mixture. 

5. The random primed products were purified by passing *the whole 
reaction mixture through Microcon-100 (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. The retentate fraction (about 65 pi in .volume) was 
recovcred'from the Microcon-10. This fraction containing the desired random 
priming products was buffer* exchanged against PCR reacuon buffer wi:h the 
new Microcon-10 further use in whole gene reassembly. 

2.3 Random primed DNA synthesis with the T7' sequenase v2.0 DNA 
polymerase 

Since the T7 sequenase v2.0 DNA polymerase lacks exonuclcasc 
activity and is highly processive, the average length of DNA -synthesized is 
greater than that of DNAs syr. Resized by the Klcnbw" fragment or DNA 
polymerase. But in the presence of proper amount of MnCh 'in the reaction, 
the sLze of the synthesized fragments can be controlled to less than 4 00 bps. 

1. -■ 200 ng (about. 0.7 pmoi). of .HI DNA. and .equal amount of R2 DNA 
dissolved -in H^O was mixed with 13.25 pg (about '6.7 nmoi) of dp(N)s random 
primers. After immersion in boiling water for 5 minutes, the mixture was 
placed immediately in an ice/ethanol bath. The presence of high concen- 
trations of primer is thought to lead to steric hindrance. 

2. Ten pi of 10 x reaction buffer (10X buffer: 400 mM Tris-HCl, pH 7.5; 
200, mM magnesium chloride, 500 mM NaCl, 3 mM MnCl?, and 3 mM each 
cLATP, dCTP, dGTP and dTT?) was added to the denatured sample, and the 
total volume of the reacuon mixture was brought up to 99.2 pi with H 2 0. 

3. Ten units (about 0.8 pi) of the T7 Sequenase v2.0 - (Amersham Life 
Science, Cleveland, Ohio) was added. All the components were mixed by 
gently tapping the outside of the tube and were centrifuged at 12,000 g for 1-2 
seconds in a microfuge to move all the liquid to the bottom. The reaction was 
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carried out at 22°C for 15 minutes. Under the reaction conditions described 
here the random priming products are approximately 50-400 bps. 

4. After 15 minutes at 22°C, the reaction was terminated by cooling the 
sample to 0°C on ice. 100 pi of ice-cold H2O was added to the reaction 
mixture. 

5. The random primed products were purified by passing the whole 
.reaction mixture through Microcon-100 (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. The retentate fraction (about 65 pi in volume) was 
recovered from the Microcon-10. This fraction containing the desired random 
priming products was buffer-exchanged against PCR reaction buffer with the 
new Microcon-10 further use in whole gene reassembly. 

2.4. Random'primed DNA synthesis with PCR using the Stoffel fragment of 
Taq DNA polymerase . 

Similar to the Klenow fragment of £. coli DNA polymerase I, the Stoffel 
fragment of Taq DNA polymerase lacks 5' to 3' exonuclease activicy. It is also 
more thermostable than Taq DNA polymerase. . The, Stoffel fragment'has low 
processiviry, extending a primer an average of only 5-10 nucleotides before it 
dissociates. As a result of its lower proccssivity, it may also have improved 
fidelity. 

1. 50 ng -(about C.175 prr.o!) of Rl DNA and equal amount of R2 DNA 
dissolved in H:0 was mixed with 6.13 vg (about 1.7 nmol) of dp(N)i2 random 
primers. ■ - 

2. Ten pi of lOx reaction pre-mix [lOx reaction pre-mlx: 100 mM Tris-HCl, 
' pH 8.3; 30 mM magnesium chloride, 100 mM'KCl/and 2 mM each dATP r 

dCTP, dGTP.and dTTP) was added, and the total volume of the reaction 
"mixture was brought up to 99.0 pi with H^O. 

•3. After incubation at 96°C for 5. minutes, 2.5 units (about 1.0 pi) of. the 
Stoffel fragment of Taq DNA polymerase (Perkin-Elmcr Corp., Norwaik, CT) 
was added. Thirty-five thermocycics were performed, each with. 60 seconds at 
95°C, 60 seconds at 55°C and 50 seccr.ds at 72°C, without the extension step 
of the last cycle, in a DNA Engine PTC -200 (MJ Research Inc., V/atcrtown, MA) 
apparatus. Under the reaction conditions described here the random priming 
products are approximately 50-500 bp. 

4, The reaction was terminated by cooling the sample to 0°C on ice. 100 ul 
of ice-cold H2O was added to the reaction mixture. 
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5. The random primed products were purified by passing the whole 
reaction mixture through Microcon-100 (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. The retentate fraction (about 65 pi in volume) was 
recovered from the Microcon-10. This fraction containing the desired random 
priming products was buffer- exchanged against PGR reaction buffer with the 
new Microcon-10 further use in whole gene reassembly. 

2.5 Random primed DNA synthesis with PCR using Pfu DNA polymerase 

Pfu DNA polymerase is extremely thermostable, and the enzyme 
possesses an inherent 3* to 5' exonuclease activity but does not possess a 
5'— >3' exonuclease acuvity. Its base substitution fidelity has been estimated to 

be 2 x 10" 6 . 

1, 50 ng (about 0.175 pmo!) of Rl DNA and equal amount of R2 DNA 
dissolved in H^O was mixed with 6.13mg (about 1.7 nmol) of dp(N)i2 random 
primers. 

2. Fifty pi of 2 x reaction pre -mix \2 x reaction prc-mL\: 5-fold diluted 
cloned Pfu buffer (Stratagene, La Jolia, CA), 0.4 mM each dNTPj, was added, 
and the total volume. of the reaction mixture was brought up to 99.0 pi with 

H2O. 

3. After incubation at 95'C for. 5 minutes, 2.5 units (about 1.0 of Pfu 
DNA polymerase (Stratagene, La Jolla. CA) was added. Thirty-five thermo- 
cycies were performed, each with 60 seconds at 95°C, 60 seconds at 55 3 C and 
50 seconds at 72°C, without the extension step of the last- cycle, in a DNA 
Engine PTC-200 (MJ Research Inc., V/atertown, MA) apparatus. Under the 
reaction conditions described here - the major random priming products are 
approximately 50-500 bp. - 

4. The reaction was terminated by cooling the sample to 0°C on ice. 100 
pi of ice-cold H2O was added to the- reaction mixture. 

5. The random primed products were purified by passing the whole 
reaction mixture through Microcon-100 (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. . The retentate fraction (about 65 pi in volume) was 
recovered from the Microcon-10. This fracdon containing the desired random 
priming products was buffer-exchanged against PCR reaction buffer with the 
new Microcon-10 further use in whole gene reassembly. 
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(3) Reassembly of the whole gene 

1 For reassembly by PCR ; 10 pi of the random-primed DNA fragments 
from Microcon-10, 20 ul of 2 X PCR pre-mix (5-fold diluted cloned Pfu buffer, 
0.5 mM each dNTP, O.lU/pl cloned Pfu polymerase (Stratagene, La Jolla, CA]). 
15 pi of H2O were mixed on ice. • 
2. After incubation at 96'C for 3 minutes, 40 thermocycles were 
performed, each with 1.0 minute at 95'C, 1.0 minute at 55'C and 1.0 minute 
+ 5 second/cycle at 72'C, with the extension step of the last cycle proceeding 
at 72'C for 10 minutes, in a DNA Engine ■ PTC-200 (MJ Research Inc.. 
Watertown, MA) apparatus without adding any mineral oil. 
3. 3 pi aliquots at cycles 20, 30 and 40 were removed from the reaction 
mixture and analyzed by agarose gel electrophoresis. The reassembled PCR 
product at 40cycles contained the correct size product in a smear of larger 
and smaller sizes. ■ 

(4) Amplification 

The correctly reassembled product of this first PCR was further 
amplified in a -second PCR reaction which contained the PCR primers 
complementary to .the ends of the template DNA. 

L. - 2.0 pi of the PCR reassembly aliquots were used as template in 100-pl 
standard' PCR reactions, which contained 0.3 mM each' primers of PI (5' 
CCGAGCGTTGC • ATATGTGGAAG 3 ) (SEQ. ID. NO: 15) and P2 (5' 
, CGACTCTAGAGGATCCGATTC J) (SEQ. ID. NO: 16). 1.5 mM MgCb. 10 mM 
Tris-HCl [oH 9.0). 50 mM KC1..200 mM. each of the four dNi Ps, 2.5 U of Taq 
polymerase (Promega. Madison. Wi, USA) and 2.5 U of Pfu polymerase 
(Siratagene, La Jolla, CA). , 

2. After Lncubation at 96'C for 3 minutes, 15 thermocycles were 
oerformed, each with 60 seconds at 95°C, 60 seconds at 55'C and SO seconds' 
"at 72-C, followed by additional 15 thermocycles of 60 seconds at 95«C. 60 
seconds at 55°C and 50 seconds^ 5 second/cycle) at 72'C with the extension 
step of the last cycle proceeding at 72'C for 10 minutes, in a DNA Engme PTC- 
200 (MJ Research Inc., Watcrtowr, MA) apparatus without adding any mineral 

3. The amplification resulted in a large amount of PCR product with the 
correct size of the subtiiisin E whole gene. 

(5) Cloning 
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Since the short DNA fragments were generated with five difiercat DNA 
polymerases, there were five pools of final PCR amplified reassembled 
products. Each of the' DNA pool was used for constructing the corresponding 
subtiiisin E mutant library. 

1. The PCR amplified reassembled product was purified by Wizard DNA- 
CleanUp kit (Promega, Madison, WI), digested with Bam HI and Nde 1, 
electrophoresed in a 0.8% agarose gel. The 986-bp product was cut from the 
gel and purified by Wizard PCR Prep kit (Promega, Madison, WI). Products 
were ligated with vector generated by Bam Hl-Nde 1 digestion of the pBE3 
shuttle vector. 

2. £. coli HB101 competent cells were transformed wish the above ligation, 
mixture to form a mutant library. About 4,000 transformants from this library 
were pooled, and recombinant plasmid mixture was isolated from this pool. 

3. B. subtilis D3428 competent cells were transformed with the above 
isolated plasmid mixture 'to form another library of the subtiiisin E variants. 

4. Based on the DNA polymerase used for random priming the short, 
nascent DNA fragments, the five libraries constructed here were named: 
Hbrary/Klenow, library/T4, library/Scquenase, library/ Stoffel and library/ Pfu. 
About 400 iranformants from each library were randomly picked and 
subjected to screening for thermostability (see Step (7)]. 

(5) Random clone sequencing 

Ten random clones from the 3. subtilis D3423 library/ Klenow was 
chosen for DNA sccuer.ee analysis. ' Recombinant plasmids were individually 
purified- from R. subtilis D.3423 using a QlAprcp^ spin plasmid miniprcp kit 
(QIAGEN) with the modification that 2 mg/ ml lysozyme was added to PI buffer 
and the cells were incubated for 5 minutes at 37'C, retransformed into 
competent E. coli HB 101 and then purified again using QLAprep spin plasmid 
miniprep kit to obtain sequencing quality DNA. Sequencing was done on an 
ABI 373 DNA Sequencing System using the' Dye Terminator Cycle Sequencing 
kit (Perkin-Elmcr Corp., Norwaik, C7). 

■ (7) Screening for thermostability 

About 400 transformants from each of the five libraries described at 
Step (4) were subjected to screening. Screening was based on the assay 
described previously (33, 35), using succinyl-Ala-Ala-Pro-Phc-p-nitroanilidc 
(SEQ ID. NO: 25) as substrate. 3. subtilis DB428 containing the plasmid 

' library were grown on LB/kanamycm (20 ug/ml) plates. After 18 hours at 



WO 98/42832 



PCT7US98/05956 



-if 

37°C single colonies were picked into 96-well plates containing 100 pi 
SG/kanamycin medium per well. These plates were shaken and incubated at 
37 8 C for 24 hours to let the cells to grow to saturation. The cells were spun 
down, and the supematants were sampled for the thermostability assay/ 
Three replica 96-well assay plates were duplicated for each growth plate, with 
each well containing 10 ml of supernatant. The subtilisin activities were then 
measured by adding 100 ml of activity assay solution (0.2 mM succinyl-Ala- 
Ala-Pro-Phe-p-nitroanilide (SEQ. ID. NO: 25), 100 mM Tris-HCl, 10 mM CaCh, 
pH 8.0, 37 °C). Reaction velocities were measured at 405 nm over 1.0 min. in 
a ThermoMax microplate reader (Molecular Devices, Sunnyvale CA). Activity , 
measured at room temperature was used 'to calculate the fraction of active 
clones (clones with activity less than 10% of that of wild type were scored as 
inactive). Initial activity (Ai) was measured after incubating one assay plate at- 
65"C for -10 minutes by immediately adding 100 pi of prcwarmed (37'C) assay 
solution (0.2mM succinyl-Ala-Ala-Pro-Phe-p-nitroanilidi (SEQ. ID. NO: 25). 
100 mM Tris-HCl, pH 8.0. 10 mM CaClj) into each well. Residual activity (Ar) 
was measured after 40 minute incubation. 

' (8) Sequence Analysis 

After screening, one clone that showed the highest thermostability 
within the 400 transformar.ts from the library/ Klehow was re-streaked on 
L3/kanamycin agar plate, and single colonies .derived from this plate were 
inoculated into tube cultures, for glycerol stock and plasmid preparation. The 
recombinant plasmid was purified using a QIAprcp spin plasmid miniprep kit 
(QIAGEN) with the modification ;that 2 mg/ml lysozyme was added to PI buffer 
and the cells were incubated for .5 minutes at 37°C, rctransiormcd into 
' competent £. co.'i HB 101 and then purified again using QlAprep spin plasmid 
miniprep kit to obtain sequencing quality DNA. Sequencing was done on an 
ABI 373 DNA Sequencing System using the Dye Terminator Cycle Sequencing 
kit (Pcrkin-Elmer Corp., Norwaik, CT). 

Results 

.1. Recombination frequency and efficiency associated with the random- 
sequence recombination. 

The random pruned process was carried out as described above. The 
process is illustrated in FIG. 1. Ten clones from the mutant library/ Klcnow 
were selected at random and sequenced. As summarized in FIG. 12 and Tabic 
5, all clones were different from the parent genes. The frequency of occurrence 
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of a particular point mutation from parent Rl 'or R2 in the recombined genes 
ranged from 40% to 70%, fluctuating around the expected value of 50%. This 
indicates that the two parent genes have been nearly randomly recombined 
with the random primer technique. FIG. 12 also shows that all ten mutations 
can be recombined or dissected, even those that are only 12 bp apart. 

We then estimated the rates of subtilisin thermoinactivab'on at 65°C by 
analyzing the 400 random clones from each of the five libraries constructed at 
Step (5). The thermostabilities obtained from one 96-well plate are shown in 
FIG. 13, plotted in descending order. Approximately 21% of the clones 
exhibited thermostability comparable to the mutant with the N181D and 
N218S double mutations. This indicates that the N181D mutation from RC2 
and the N218S mutation from RC1 have been randomly recombined. 
Sequence analysis of the clone exhibiting the highest thermostability among 
the screened 400 trans form ants from the library/ KJenow showed the mutation 
N181D and N218S did exist. 

2. Frequency of newly introduce:: mutations during the random priming 
process 

Approximately 400 trans form ants from each of the five B.sublilis 
D3428 libraries (sec Step (5jj were picked, grown in SG medium 
supplemented with 20 ug/rr.l kar.amycin in 96 -well plates and subjected to 
subtilisin E activity screening. Approximately 77-84% of the clones expressed 
active enzymes, while 16-23% of the transformants were inactive, presumably 
as a result of newly intrc^'jccc mutations. From previous experience, \vc 
know that this rate of inactivation indicates a mutation rate on the order of 1 
to 2 mutations per gene (35). 

As shown in FIG. 12, IS new point mutations were introduced in the 
process. This error rate of 0.18% corresponds to 1-2 new point mutations per 
gene, which is a rate that has been determined from the Inactivation curve. 
Mutations are nearly randomly distributed along the gene, . 
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TABLE 5 

DNA and amino acid residue substitutions in the ten random 
clones from Library/Klenow 



Clone ff 


Position 


Base 

OUU>)ULUUU(i 


Substitution 
Tvne 


Amino Acid 
Substitution 


Substitution 
TvTae 


C#i 


839 


A-+C 


transversion 


Gly-»Gly 


synonymous 


C#2 


722 


A->G 


transition 


Ser-»Scr 


synonymous 


C#2 


902 


T->C 


transition 


Val-*VaJ 


synonymous 


C#2 ; 


1117 


C-+G 


transversion 


Ser->Ser 


synonymous 


C04 


809 


T-+C 


transition 


Asn-»Asn 


synonymous 




1098 


G-*C 


transversion 


Gly-*Ala 


non-synonymous 


G#4 


1102 


T-C 


transition 


Ala-*Ala 


synonymous 


C*6 


653 


C->A 


transversion 


His-»Iie 


n o n -syr.o nymou s 


Ctf6 


654 


A-r*T 


transversion 


His-*I!c 


no n- synonymous 


C*6 ■ 


657 


T-+C 


transition 


Val->Ala 


n o n - syno nymo u s 


C«6 


653 


A-+C 


Lransvcrsion 


' Va!-+Ala 


n o n - syr. o n ym o u s 


C*6 


1144 


A->G 


transition 


. A!a-*Ala 


synonymous 


C*6 


1147 


A-*G 


transition 


Ala-+Aia 


synonymous 


C*7 


478 


T-»C 


transition " 


Ilc-lIc 


synonymous 


C«9 


731 


• A-+G 


transition 


Ala->Ala 


spor.TOous 


C*9 


994 


A-G 


transition 


Val-»Val 


synonymous 


C*10 


1111' 


A — + G 


transition 


Gly-Cly 


synonymous 


CMO ■ 


1112 




transversion 


Thr-»Ser 


n o n - syr. o n ymo us 



The mutation types arc listed in TABLE 5. The' direction of mutation is 
clearly nonrandom. For example, A changes more often to.G than to either T 
or C. .All transitions, and in particular T.-C and A-G, occur more often than 
transversion. Some nucleotides are more mutable than others. Onc'G->C, one 
C-*G and one C-»A transversions were found within the 1 10 sequenced clones. 
These mutations were generated very rarely during' the ..error-prone PCR 
mutagenesis of subtiUsm (37). Random-priming process may allow access to a 
greater range of ammo acid substitutions than PCR-based point mutagenesis. 

It is interesting to note that a short stretch of 5' C GGT ACG CAT GTA 
GCC'GGT ACG 3' (SEQ. ID. NO: 15) at the position 646-667 in parents Rl and 
. R2 was mutated to 5' C GGT ACG ATT GCC GCC GGT ACG 3' (SEQ. ID. NO: 
17) in random clone C = 6. Since the stretch contains two short repeats at the 
both ends the newly introduced mutations may result from a splipped-stxand 
mispairing process instead of point-mutation only process. Since there is no 



iram 



e-shift, this kind of slippage may be useful for domain conversion. 
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3. Comparison of different DNA polymerase fidelity in the random-priming 
process 

During random -priming recombination, homologous DNA sequences 
are nearly randomly recombined and new point mutations are also 
introduced. Though these point mutations may provide useful diversity for 
some in iritro evolution applications, they are problematic recombination of 
beneficial mutations already identified previously, especially when the 
mutation rate is this high. Controlling error rate during random ■ priming 
process is particularly important for successfully applying this technique to 
solve in vitro evolution problems. By choosing different DNA polymerase and 
modifying the reaction conditions, the random priming molecular breeding 
technique can be adjusted to generate mutant libraries with different error 
rates. 

The Klenow fragment of E.coli DNA pojymerase 1, bacteriophage T4 DNA 
polymerase, T7 sequenasc version 2.0 DNA polymerase, the Stoffcl fragment of 
Taq polymerase and Pfu polymerase have been tested for the nascent DNA 
fragment synthesis. The activity profiles of the resulting five populations (sec 
Step (5)] are shown in FIG. 13. To generate these profiles, activities of the 
individual clones measured in the 95-well plate screening assay are plortcd in 
descending order. The Library/ Stoffei and Library/ Klenow contain higher 
percentage of wild -type or inactive sub til ism E clones than that of the 
Library/ Pfu. In all five populations, percentage of the wild-type and inactive 
clones ranges from 1 7-20%. 

EXAMPLE 8 

Use of defined flanking primers and staggered extension 
to recombine single stranded DNA 

This -example demonstrates the use of the defined primer recombi- 
nation with staggered extension in the recombination of single stranded DNA. 

Method Description 

Single-stranded DNA can be prepared by a- variety of methods, most 
easily from plasmids using helper phage. Many vectors in current use arc 
derived from filamentous phages, such as M13mp derivatives. .After 
transformation into ceils, these vectors can give rise both to a new double - 
- stranded circles and to a single-stranded circles derived from one of the two 
strands of the vector. Single-stranded ci r cles are packaged into phage 
particles, secreted from cells and can be easily purified from the culture 
supernatant. 
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Two defined primers (for example, hybridizing to 5 and 3 ends of the 
templates) are used here to recombine single stranded genes. Only one of the 
primers is needed before the final PCR amplification. Extended recombination 
primers are first generated by the staggered extension process (StEP), which 
consists of repeating cycles of denaturation followed by extremely abbreviated 
annealing/ extension step(s). The extended fragments are then reassembled 
into full-length genes by thermocycling-assisted homologous gene assembly in 
the presence of a DNA polymerase, followed by a gene amplification step. 

The progress of the staggered extension process is monitored by 
removing aliquots (10 ul) from the reaction tube (100 ul starting volume) at 
various time points in the primer extension and separating DNA fragments by 
agarose gel electrophoresis. Evidence of effective primer extension is seen as 
appearance of a low molecular weight 'smear' early in the process which 
increases in molecular weight with increasing cycle number. Initial reaction 
conditions are set to allow template denaturation (for example, 9VC-30 
second denaturation) followed by very 'brief annealing/ extension step(s) (e.g. 
55'C-l to' 15 seconds) repeated through 5-20 cycle increments prior to 
reaction sampling. Typically, 20-200 cycles of staggered extension arc 
required to generate single stranded DNA 'smears' corresponding to sizes 
greater than the length of the complete gene. - 

The ex-ocrimcntjaJ design is as in Example 1. Two thermostable 
subti'lisin E mutants- Rl and R2 gene arc -subcloncd into vector M 1 3m? 18 by 
'restriction digestion with EcpRi and 3amH!. Single stranded DNA is prepared 
as described (39). 

Two flanking primer based recombination . 

Two defined primers; ?5N (5'-CCGAG CGTTG CAT AT GT GGA AG-3' '■■ 
(SEQ. ID. 'NO: 18)/underl:ned sequence is Ndel restriction site) and P33 (5*- 
CGACT CTAGA GGATC C GATT C-3* (SEQ. ID.. NO: 19),, underlined sequence is 
■ BamHI restriction site), corresponding to 5' and 3' . flanking primers, 
respectively, are used for recombination. Conditions (100 ul final volume): 
0.15 pmol single-stranded DNA containing Rl and R2 gene (mixed at 1:1) arc 
used as template, 15 pmol of one fan king primer -(either P5N or P33), lx Taq 
' buffer, 0.2 mM of each dNTP, 1.5 mM MgCh and 0.25 U Taq polymerase. 
Program: 5 minutes of 95 = C, 80-200 cycles of 30 seconds at 94*C, 5 seconds 
at 55"C. The single-stranded DNA products of correct size (approximately 
lkb) are cut from 0.8% agarose gel after electrophoresis and purified using 
QLAEX II gel extraction kit. This purified' product is amplified by a 
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conventional PCR. Condition (100 ul final volume): 1-10 ng of template, 30 
pmol of each flanking primer, lx Taq buffer, 0.2 mM of each dNTP, 1.5 mM 
MgCb and 0.25 U Taq polymerase. Program: 5 minutes at 95°C, 20 cycles of 
30 seconds at 94°C, 30 seconds at 55'C, 1 minute at 72°C. The PCR product 
is purified, digested with Ndel and BamHI and subcloned into pBE3 shuttle 
vector. This gene library is amplified in £. coli HB101 and transferred into B. 
subtilis DB428 competent cells for expression and screening, as described 
elsewhere (35). Thermostability of enzyme variants is determined in the 96- 
well plate format described previously (33). 

This protocol results in the generation of novel sequences containing 
novel combinations .of mutations from the parental sequences as well as novel 
point mutations. Screening allows the identification of enzyme variants that 
are more thermostable than the parent enzymes, as in Example 1. 

As is apparent from the above examples, primer-based recombination 
may be used to explore the vast space of potentially useful catalysts for their ' 
"optimal performance in a wide range of applications as well as to develop or 
evolve new enzymes for basic stru era re -function studies. 

While the present specification describes using DNA-dcpendcrit \ DNA 
polymerase' and single-stranded DNA as templates, alternative protocols are 
also feasible for using single-stranded RNA as a template. By using specific 
■ protein mRNA as the template and RNA-dcpcndcnt DNA polymerase (reverse 
transcriptase) as the catalyst, the methods described herein may be modified 
to introduce mutations and crossovers into cDNA clones and to create 
molecular divcrsir/ directly "from the mRNA level to achieve the goal of 
optimizing protein functions. This would greatly simplify the ETS (expression- 
tagged strategy) for novel catalyst discovery. 

In addition to the above, the present invention is also useful to probe 
proteins from obligate intracellular pathogens or other systems where cells of 
interest cannot be propagated (33). 

Having thus described exemplary embodiments of the present inven- 
tion, it should be noted by those skilled in the art that the within disclosures 
are exemplary only and that various other alternatives, adaptations, and 
modifications 'may be mace ■ within the scope of the present invention. 
Accordingly, the present invention is not limited to the specific embodiments 
as Illustrated herein, but is only limited by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Frances K. Arnold 

Zhixin Shao 
Joseph A. Affholter 
Huimin Zhao . ' 
Lori Giver : . 

(ii) TITLE OF INVENTION: Recombination^ of Polynucleotide 
Sequences Using Defined or Random Primer Sequences 

(iiij NUMBER OF SEQUENCES : 25 ' ■ 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Oppenheimer Poms Smith 

(3) STREET: 2029 Century Park East, Suite 3300 

(C) CITY: Los Angeles *■■ 

. (D) STATE: CA . ' . n ' 

(E) COUNTRY: USA 

(F) ZIP: 90067 

(v) COMPUTER READA3LE FORM: 

(A) MEDIUM TYPE: Floppy dis>: / 
{3} COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: Wir.do-'S 

(D) SOFTWARE: Microsoft Word 6.0 

(vi) CUR-RE * JT APPLICATION* DATA: " 
(A) APPLICATION 17JM3ER: 
(3) FILING DATE: • 
CC) CLASSIFICATION': 

( v i i J PRIOR APPLICATION* DATA: 

(A) APPLICATION' ' >~JM3E?. : 60/041,556 
(3} FILING DATE: -arch 25,-199'? 
tC) APPLICATION* NVM3ER: 50/045., 211. 
_ (Dh, FILINC^DA*^^ April _3 0j..^97. _ _ _ 

(E) APPLICATION* NUMBER: 50/045,256 

(F) FILING DATE:.. May 1.2. 1937. 

' (viii) ATTORNEY/AGENT INFOP.MATION : 

(A) NAME: Oldenkamp, David C\ 
{3) REGISTRATION NUMBER : ,29,421 \ 
• (C) REFERENCE/ DOC:3T NUMBER: 330187 -84 ^ ' 

t (ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: ( 310) 7 3 5 -.5 0 C 0 

(3)' TELEFAX: (310) 277 - 12 97 ^ ■ 

' "(2) INFORMATION FOR SEQ ID NO : 1: 

(1) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE : ol igonucl eotide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCG AGC GTT GCA TAT GTG GAA G 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGA CTC TAG AGG ATC CGA TTC 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 1 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAG CAC ATC AG A TCT ATT AAC 

(2) INFORMATION FOR SEQ ID NO:. '4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH : 21 nucleosides.' 

(B) TYPE: nucleotide^ 

(C) TOPOLOGY:' linear 

(ii) MOLECULE TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-: 4: 
GGA GTG GCT CAC AGT CGG TGG 



(2) INFORMATION FOR SEQ' ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 21 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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TTG AAC TAT CGG CTG GGG CGG 21 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTA CTA GGG AAG CCG CTG GCA - 21 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 nucleotides 

(B) TYPE:' nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SZZ I 0 NO : 7 : 
TCA GAG ATT ACG ATC GAA AAC 2 1 

(2) INFORMATION FOR SEQ ID NO: S : 

, <i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY : linear 

(ii) - ■MOLECULE -TYPE-: o 1 i c cr.uc 1 eot ide -■ - - - -- 

(xi) SEQUENCE DESCRIPTION': SEQ ID NO: 3: 

GGA TTG TAT CGT GTG AGA AAG . ~1 

(2) INFORMATION FOR SEQ ID NO: 9: • 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 21 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide, 
(xi) SEQUENCE DESCRIPTION': SEQ ID NO: 9: 

T GCC GGA AGC A3 ^ C ^_ : 21 
2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: . 

(A) 'LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY : linear 

(ii) MOLECULE TYPE: oligonucleotide 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO': 10: 
CAC GAG AGG AAG ATT TTG ACT 

(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 nucleotides _ ■" • 
(3) TYPE: nucleotide 
(C) TOPOLOGY : linear 

(ii) MOLECULE TYPE:, oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: S£Q ID NO: 11: 
ACT TAA TCT AG A GGG TAT TA _ . 

(2) INFORMATION' FOR SEQ ID NO: 12: 

• (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 nucleotides 
(3) TYPE : nucleotide 
(C) TOPOLOGY: 1 ir.ear . 

'(ii) MOLECULE TYPE: oi icpnuc ieot ids 

(xi) SEQUENCE DESCRIPTION": SEQ'"lD NO:' 12 

AGC CTC GCG GG~ TCC CCG GG 

(2) INFORMATION FOR SEQ ID NO: 1 13: 

■ (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2S nucleotides 
' * (3) TYPE: nucleotide. 
(C) TOPOLOGY : linear 

(ii) MOLECULE TYPE: oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SI? ID NO: 1 

GGT AG A GCG ACT CTC GAG GGG GAG ATG C 

(2) INFORMATION FOR SEQ ID NO: 1-1: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 22 nucleotides 
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-(B) TYPE : nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AGC CGG CGT GAG GTG GGT CAG C 2 2 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SZQUZUCE CHARACTERISTICS: 

(A) LENGTH: 22 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCG AGC GTT GCA TAT GTG GAA G 



• (2) INFORMATION FOR SEQ ID NO : 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 nucleotides 

( B ) TYPE: nucleotide 

(C) TOPOLOGY: linear 

' ' ' ( i i ) MOLECULE TYPE: oligonucleotide 

Cxi) SEQUENCE DESCRI ?TION : SZQ ID NO: 15: 
CGA CTC TAG AGG ATC CGA TTC 



(2) -I-N-F-OR.".ATICN FOR Sz,Q 10 NO : . 17:. 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 2 nucleotides 
, (3). TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

' CGG TAC GCA TGT AGC CGG TAG C 



(2) INFOP.MATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : ' 2 2 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: oligonucleotide . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 :■- 
' CGG TAC' CAT TGC CGC CGG TAC G 

(2) INFORMATION FOR SEQ ID NO.: 19: 

(i) SEQUENCE CHARACTERISTICS :. 

(A) , LENGTH : - 22 nucleotides ? ' 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 19: 
CCG AGC GTT GCA; TAT GTG CAA G. 

(2.) INFORMATION FOR SEQ ID NO: 20: \ ,'. 

(i) SEQUENCE CHARACTERISTICS': 
(A) LENGTH: 21 nucleosides 
{B).'TY?E: nucleotide 
(C) TOPOLOGY : linear 

(ii) MOLECULE TYPE: oligonucleotide • 

(xi) .SEQUENCE DESCRIPTION: S EQ * 1 D ' NO : 2 6 : 

CCA CTC'TAG AGC AfC CGA TTC 



(2) IN* FORMATION F.O R. SEQ ID NO : 2 1 : /, . . " 

(i)' SEQUENCE 'CHARACTER- 3t ICS : 

(A) LENGTH : 13 nucleotides . . 

,, (3) TYPE : r.ucleot i'de ^ 

(C) TOPOLOGY : linear \. ' ' 

*(ii) MOLECULE TYPE: oligonucleotide , 

. (Xi) SEQUENCE DESCR I ?T ION : SEQ ID" NO: 21.: ; 

GGC GGA GCT AGC TTC GTA 



(2) INFORMATION FOR SEQ ID NO: -22: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 13 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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GAT GTG ATG GCT CCT GGC 



18 



(2) INFORMATION FOR SHQ ID NO: 23: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: oligonucleotide 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



CAG AAC 



ACC GAT TGA GTT 



18 



(2) INFORMATION FOR SEQ ID NO: 2 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: IS nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:, 24: 

ACT GCT TTC TAA ACC ATC IB 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SZQ'JZV.CZ CHARACTERISTICS: 
(A) LENGTH 4 a-.iho acids 
(3) TYPE: peptide 
(C) TOPOLCG: : linear 

(ii) -MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION*: SEQ ID NO: 25:. 
Ala Ala Pro Phe 
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, What is claimed is : 

1. A method .for making double-stranded mutagenized 
polynucleotides from at least one template polynucleotide wherein said 
mutagenized. polynucleotides has at least one nucleotide which is, different 
from the nucleotide at the same position; in said template' polynucleotide, said 

method comprising: 

a) conducting enzyme-catalyzed DNA polymerization synthesis from 
random-sequence , or defined-sequence primers in the presence of said 
template polynucleotide to form a DMA pool which' comprises short 
polynucleotide fragments and said template polynucleoude(s); , . 

■ b). denaturing said DNA pool into a pool of single-stranded fragments', 

c) ' allowing said single-stranded fragments to anneal, under annealing 
conditions, to form a pool of .annealed fragments; 

- d) incubating said pool of annealed fragments with polymerase under 
conditions which result in extension, of said double-stranded fragments to 
form a fragment pool, comprising extended .single-stranded fragments; .- 

e) repeating steps b) through" c) until said fragment pool contains said 
mutagenized .polynucleotides. 

, '■ 2. ■ A method for making double-stranded mutagenized 'polynuclco- . 
tides according to claim 1 wherein said jingle-stranded fragments have areas, 
of comolcmentarkv and- wherein said step of "incubating said pool of annealed 
fragments is conducted under conditions in which, the short polynucleotide 
strands or extended short polynucleotide strands of each of said annealed 
fragments prime each other to" fqrrn said fragment pool.. 

-3 A method for making .double-stranded' mutagenized. polynucleo- 
tides according to claim 1 wherein said step of incubating said pool of 
annealed ■ fragments is conducted in the presence of said template 
polynucleotide(s) to provide random repriming of said single-stranded 
polynucleotides and said template :: :-:;.mucieotide[s). 

4. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 1 wherein a: least one of said primers is a defined 
sequence primer. 
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5. A method for making double-stranded mutageniz-ed polynucleo- 
tides according to claim 2 wherein at least one of said primers is a defined 
sequence primer. 

6. A method for making double-stranded mutagenizcd polynucleo- 
tides according to claim 3 wherein at least one of said primers is a defined 
sequence primer. 

7. ' A method for making double-stranded rnutagenized polynucleo- 
tides according to claim 4 wherein said primer comprises from 6 to 100 
nucleotides. ; 

8. A method for making double-stranded rnutagenized polynucleo- 
tides according to claim 5 wherein said primer comprises from 6 to 100 
nucleotides. - 

9. A method for making double-stranded mutagenizcd polynucleo- 
tides according 'to claim 6 wherein said primer comprises from 6 to 100 
nucleotides. 

10. A method for making double-stranded mutagenizcd polynucleo- 
tides according to claim * wherein a: least one defined terminal primer is 
used. 

11. A method for making "double-stranded mutagenizcd polynucleo- 
tides according to claim 5 wherein at least one defined terminal primer is 
used. 

• 12. A method for making double-stranded mutagenizcd polynucleo- 
tides according to claim 6 wherein a: least one defined terminal primer is 
used. 

13. A method for making double-stranded mutagenizcd polynucleo- 
tides according to claim 1 wherein said primers are defined sequence primers 
exhibiting limited randomness at one or more nucleotide positions within the 
primer. 
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14. A method for making double -stranded mutagenized polynucleo- 
tides according, to claim 13 wherein said primers comprise from 6 to 100 
nucleotides. - 

15. A method for making double-stranded mutagenized polynucleo- ■ 
tides according to claim 13 wherein two or more defined primers- specific for , 
any region of the template are used. 

16. . A method for making double-stranded mutagenized polynucleo- 
tides according to claim 1 wherein said primers are' defined sequence primers " 
exhibiting limited 'randomness a: more than 30% of the nucleotide positions 
within the primer. . > 

17. Amcthod for making double-stranded mutagenized polynucleo- * 
tides according to claim 16 wherein said primers comprise from 6: to 100 

■ nucleotides. , 

IS. . A method* for making double-stranded mutagenized polynucleo- . 
tides according to claim 16 wherein two. or more defined primers specific for 
any region "of the template arc used. , 

19. A. method for making double-stranded mutagenized potynuclco : 
tides according to cla;m 1 wherein said primers are defined' sequence primers" 
cxhibiLing limited ■ randomness, at "more than 60% of the nucleotide positions 

.within the primer. ■ ... 

20. A method for making double -stranded mutagenized polynucleo- 
tides' according to claim. 19 wherein said primers comprise from 6 to 100 

' nucleotides. 

■ 21. A. method for. making double-stranded mutagenized polynucleo- 
tides according to claim 19 wherein, two or more defined primers specific for 
any regions of the template(s) are used. 

22. A method for makmg double-stranded mutagenized pplynucleo- 
tides according to claim 1 wherein said primers are random-sequence 
orimcrs. 
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23. . A method for making double-stranded mutagenized polynucleo- 
tides according to claim 22 wherein the lengths of said primers are from 6 to 
24 nucleotides long. 

24. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 22 wherein said template polynucleotide(s) are 
removed from said DNA pool after generation of said short polynucleotide 
fragments. 

25. A method for making double- stranded mutagenized polynucleo- 
tides according to claim 1 which includes the additional steps of isolating said 
mutagenized double-stranded polynucleotides from said DNA pool and 
amplifying said mutagenized double-stranded polynucleotides. 

26. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 25 wherein said mutagenized double-stranded 
polynucleotides are amplified by the polymerase chain reaction. 

27. A method for producing an enzyme comprising the steps of: 

a) inserting into a vector a double-stranded mutagenized 
polvnuclcotide made according to claim L to form an expression vector, said 
mutagenized oolynuclcotidc encoding an enryrr.e; 

b) transforming a host cell with said- expression vector; and 

c) expre zzlr.g the chr-Tnc encoded by said mutagenized 
polynucleotide. 



2S. A process for preparing double-stranded mutagenized 
polynucleotides from at least cne template polynucleotide, said mutagenized 
oolynucleotides having at least one nucleotide which is different from Lhc 
nucleotide at the corresponding position in said template polynucleotide, 
wherein said process composes: 

(a) performing cruryme-ca talked DNA polymerization from 
random-sequence or defmed-secucr.ee primers in me presence of said 
template polynucleoddc(s) to icrrr. a DNA pool containing short polynucleotide 
fragments and said template pol>mucleotide(s); 

(b) denaturing said DNA pool into a pool of both single-stranded 
fragment polynucleotides and sLngle-stranded template polynucleotides; 
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(c) allowing the single-straiided polynucleotides of said pool to 
anneal, under annealing conditions, to form . pool of double-stranded 

annealed polynucleotides; 

(d) incubating said pool of annealed polynucleotides with DNA 
polymerase under conditions which result in extension of said double- 
stranded polynucleotides to form a DNA pool containing extended double- 

•■ stranded polynucleotides; and 

(e) repeating steps (b) through (d) until said DNA pool 
containing extended double-stranded polynucleotides contains said 
mutagenized polynucleotides. 

29. The process according to claim 28 wherein said pool of single- 
stranded fragment polynucleotides and'single-stranded template polyr.ucleo- 
■ tides contain single-stranded fragment polynucleotides having regions 
' complementary to regions of other single-stranded fragment polynucleotides in ' 

said pool such that these fragment polynucleotides anneal to each other m 

step (c), and prime each other in step (d). 

30 The process according to claim 28 wherein said single- stranded 
template polynucleotides) annual to at leas: some of the single-stranded 
fragment polynucleotides, in step (c). so as to provide random rc-primmg of 
said single-stranded fragment polynucleotides in step (d). 

' 31 . A process for preparing double-stranded mutagenized 

ooiynucleoudes from -at least r„o template polynucleotides, said template 

oolvnucleotides including a Arst template polynucleotide and a second. 

Vrrolatc polynucleotide which differ from each other, said mutagenized 

•ooiynucleoudes having at least one nucleotide which is different from the 

nucleotide at the corresponding position in said Arst template polynucleotide 

i . oop nth-- nu-l-o'ade which is . different from that at the 
and at least one otn— nu^i-j — - 

• ■ *.^\a c«-r-^ '"TiDlat" polynucleotide, wherein said 

corresponding posiuon in said sec- ^mpia,- pu.ji 

process comprises: 

„„ rfn _;~o ^-.—-catalvzed DNA ' polymerization either 
(a) peno. .:.i.- 0 >-•—.• — 

j „ o„^,,pnr» s o- from at least one- deAned-sequencc 

from a set of random-sequence p. -..-.:> 

primer, upon said template polynucleotides under standard DNA polymeriza- 
tion conditions or under conditions resulting in only partial extension, to form 
a DNA pool containing polynucleotide fragments and said template 

oolvnucleotides; 
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(b) denaturing said DNA pool into a pool of both single-stranded 
fragment polynucleotides and single-stranded template polynucleotides; 

(c) allowing the single-stranded polynucleotides of said pool to 
anneal, under annealing conditions, to form a pool of double-stranded 

S annealed polynucleotides; 

(d) incubating said pool of annealed polynucleotides with DNA 
polymerase under conditions which result in full or partial extension of said 
double-stranded polynucleotides to form a DNA pool containing extended 
double- stranded polynucleotides; and 

10 (e) repeating steps (b) through (d) until said DNA pool 

containing extended double-stranded . polynucleotides contains t said 
mutagenizcd polynucleotides; 

provided that, when (1) standard DNA polymerization conditions are used in 
step (b) or (2) full extension is the result in step (d), if at least one defined- 
15 sequence primer is used, at least one such primer must be a non-terminal 

primer. 

32. The process according to claim 31 wherein said first template 
polynucleotide differs from said second template polynucleotide in at least two 
base pairs. 

33. The process according to claim 32 wherein said two base pairs 
arc separated from each other. 

34. The process according to ciaim 33 wherein said two base pairs 
are separated from each o th c r by at least about 15 base pairs. 
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FIG. 3 
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Nhel + BamHI 



Hindm + BamHI 



Nhsl + BamHI. 
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